CHUBU UNIVERSITY
  • Print only the body of the page. Set to print background colors and images.
  • Print the whole page. Set to print background colors and images.

YAMAMOTO Kazumasa

Profile

Year of Birth 1972
Place of Birth Mimasaka, Okayama
Title Associate Professor
Belong to Dept. of Computer Science
Computer Science (graduate school)
Graduated Toyohashi University of Technology, Graduate School of Engineering
Degree Dr. of Engineering, Toyohashi University of Technology
Academic Institutional Membership Acoustical Society of Japan (ASJ)
The Institute of Electronics, Information and Communication Engineers (IEICE)
Information Processing Society of Japan (IPSJ)
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
International Speech Communication Association (ISCA)
Asia-Pacific Signal and Information Processing Association (APSIPA)
Field of Study Spoken Language Processing, Speech Signal Processing
Research, Studies Natural spoken dialogue system, Robust speech recognition, Acoustic event detection, Speech emotion recognition
Curriculum Specialized Computer Architecture, Exercise for Computer Engineering, Computer Programming

Academic Papers, Critique

"Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition.", EURASIP Journal on Audio, Speech, and Music Processing, 2014:13 (31 pages), Jan. 2014.

"A Robust/Fast Spoken Term Detection Method based on a Syllable n-gram Index with a Distance Metric.", Speech Communication, Vol.55, No.3, pp.470-485, Mar. 2013.

"Hidden conditional neural fields for continuous phoneme speech recognition.", IEICE Transactions on Information and Systems, Vol.E95-D, No.8, pp.2094-2104, Aug. 2012.

"Improving the readability of ASR results for lectures using multiple hypotheses and sentence-level knowledge.", IEICE Transactions on Information and Systems, Vol.E95-D, No.4, pp.1101-1114, Apr. 2012.

"CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments.", Acoustical Science and Technology, Technical Report, Vol.32, No.5, pp.201-210, Sep. 2011.

"Auditory perception versus automatic estimation of location and orientation of an acoustic source in a real environment.", Acoustical Science and Technology, Vol.31, No.5, pp.309-319, Sep. 2010.

"Speaker recognition by combining MFCC and phase information in noisy conditions.", IEICE Transactions on Information and Systems, Vol.E93-D, No.9, pp.2397-2406, Sep. 2010.

"Distant speech recognition using a microphone array network.", IEICE Transactions on Information and Systems, Vol.E93-D, No.9, pp.2451-2462, Sep. 2010.

"Privacy protection for speech signals.", Procedia - Social and Behavioral Sciences, Vol.2, No.1, pp.153-160, Mar. 2010.

"Privacy protection for speech information.", Journal of Information Assurance and Security (JIAS), Vol.5, No.1, pp.284-292, Jan./Feb. 2010.

"Automatic estimation of position and orientation of an acoustic source by a microphone array network.", Journal of Acoustical Society of America, Vol.126, No.6, pp.3084-3094, Dec. 2009.

"CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments.", Acoustical Science and Technology, Technical Report, Vol.30, No.5, pp.363-371, Sep. 2009.

"Mel-Wiener filter for Mel-LPC based speech recognition.", IEICE Transactions on Information and Systems, Vol.E90-D, No.6, pp.935-942, Jun. 2007.

"Development of an experimental noise annoyance meter.", Acta Acustica united with Acustica, Vol.93, No.1, pp.73-83, Jan./Feb. 2007.

"AURORA-2J: An evaluation framework for Japanese noisy speech recognition.", IEICE Transactions on Information and Systems, Vol.E88-D, No.3, pp.535-544, Mar. 2005.

"Speech recognition under noisy environments using segmental unit input HMM.", Systems and Computers in Japan, Vol.33, No.8, pp.111-120, Jul. 2002.

"Difference of speech rate, inter-phoneme's distance, and likelihood caused by speaking style and relationship among them and recognition performance.", Systems and Computers in Japan, Vol.33, No.7, pp.50-60, Jun. 2002.

"Speech recognition using hidden Markov models based on segmental statistics.", Systems and Computers in Japan, Vol.28, No.7, pp.31-38, Jun. 1997.

Lectures, Symposium, Presentation

International Conference (full paper reviewed)

"Automatic explanation spot estimation method targeted at text and figures in lecture slides.", Proc. INTERSPEECH 2017, pp.2764-2768, Sep. 2017.

"A deep neural network integrated with filterbank learning for speech recognition.", Proc. ICASSP 2017, pp.5480-5484, Mar. 2017.

"Lyric recognition in monophonic singing using pitch-dependent DNN.", Proc. ICASSP 2017, pp.326-330, Mar. 2017.

"Robust lecture speech translation for speech misrecognition and its rescoring effect from multiple candidates.", Proc. ICAICTA 2017, 6 pages, Aug. 2017.

"Investigation of glottal features and annotation procedures for speech emotion recognition.", Proc. APSIPA ASC 2016, 4 pages, Dec. 2016.

"Domain adaptation of a speech translation system for lectures by utilizing frequently appearing parallel phrases in-domain,” Proc. APSIPA ASC 2016, 4 pages, Dec. 2016.

"Effect of sympathetic relation and unsympathetic relation in multi-agent spoken dialogue system.", Proc. ICAICTA 2016, 6 pages, Aug. 2016.

"Speech analysis of sung-speech and lyric recognition in monophonic singing.", Proc. ICASSP 2016, pp.271-275, Mar. 2016.

"Combination of syllable based n-gram search and word search for spoken term detection through spoken queries and IV/OOV classification.", Proc. ASRU 2015, pp.200-206, Dec. 2015.

"Speech recognition for mixed speech and music by NMF using various cost functions and noise adaptive training methods.", Proc. APSIPA ASC 2015, pp.27-30, Dec. 2015.

"Deep neural network based acoustic model using speaker-class information for short time utterance.", Proc. APSIPA ASC 2015, pp.1222-1225, Dec. 2015.

"Robust speech recognition using DNN-HMM acoustic model combining noise-aware training with spectral subtraction.", Proc. INTERSPEECH 2015, pp.2849-2853, Sep. 2015.

"English to Japanese spoken lecture translation system by using DNN-HMM and phrase-based SMT.", Proc. ICAICTA 2015, 6 pages, Aug. 2015.

"Speech recognition based on Itakura-Saito divergence and dynamics / sparseness constraints from mixed sound of speech and music by non-negative matrix factorization.", Proc. INTERSPEECH 2014, pp.2749-2753, Sep. 2014.

"Comparison of syllable-based and phoneme-based DNN-HMM in Japanese speech recognition.", Proc. ICAICTA 2014, pp.249-254, Aug. 2014.

"Single channel dereverberation method in log-Mel spectral domain using limited stereo data for distant speaker identification.", Proc. APSIPA ASC 2013, 4 pages, Oct. 2013.

"Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound.", Proc. APSIPA ASC 2013, 4 pages, Oct. 2013.

"Development and evaluation of spoken dialog system with one agent and two agents.", Proc. INTERSPEECH 2013, pp.1896-1900, Aug. 2013.

"Chat-like spoken dialog system for a multi-party dialog incorporating two agents and a user.", Proc. iHAI 2013, 8 pages, Aug. 2013.

"Speaker tracking with spherical microphone arrays.", Proc. ICASSP 2013, pp.3981-3985, May 2013.

Awards

2012 IEICE Best Paper Award

Page Top