発表論文一覧 / Publication List

English title papers are written in English.


著書 / Books

  1. Yuki Todo, Ryota Nishimura, Kazumasa Yamamoto, Seiichi Nakagawa, "Development and evaluation of spoken dialog systems with one or two agents through two domains," Chapter in "Text, Speech, and Dialogue (Lecture Notes in Computer Science)," Ivan Habemal, Václav Matoušek (Eds.), Springer, 2013, ISBN:978-3-642-40584-6.
  2. 山本一公, 土屋雅稔, 第8章 "フリーソフトウェアによる演習", 中川聖一編著, "音声言語処理と自然言語処理," コロナ社, 2013, ISBN:978-4-339-02469-2.
  3. Kazumasa Yamamoto, Seiichi Nakagawa, "Evaluation of Privacy Protection Techniques for Speech Signals," Chapter in "Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications (Communications in Computer and Information Science)," Eyke Hüllermeier, Rudolf Kruse, Frank Hoffmann (Eds.), Springer, 2010, ISBN:978-3-642-14057-0.
  4. Toshihiko Itoh, Shin'ya Yamada, Kazumasa Yamamoto, Kenji Araki, "Prediction of Driving Actions from Driving Signals," Chapter in "In-Vehicle Corpus and Signal Processing for Driver Behavior," Kazuya Takeda, John H.L. Hansen, Hakan Erdogan, Huseyin Abut (Eds.), Springer, 2008, ISBN:978-0-387-79581-2.

解説論文 / Tutorial Papers

  1. 中川聖一, 山本一公, 土屋雅稔, "音声に含まれるプライバシ情報の保護," 人工知能学会誌, Vol.24, No.2, pp.190--195, Mar. 2009.

学会誌論文 / Journal Papers

  1. Seiichi Nakagawa, Keisuke Iwami, Yasuhisa Fujii, Kazumasa Yamamoto, "A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric," Speech Communication, Vol.55, No.3, pp.470--485, Mar. 2013.
  2. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Hidden conditional neural fields for continuous phoneme speech recognition," IEICE Transactions on Information and Systems, Vol.E95-D, No.8, pp.2094--2104, Aug. 2012.
  3. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Improving the readability of ASR results for lectures using multiple hypotheses and sentence-level knowledge," IEICE Transactions on Information and Systems, Vol.95-D, No.4, pp.1101--1114, Apr. 2012.
  4. 北岡教英, 矢野浩利, 杉本夏樹, 山本一公, 中川聖一, "複数理解候補の保持と効率性・自然性を考慮した応答生成による誤認識に頑健な音声対話戦略とその評価," 電子情報通信学会論文誌, Vol.J95-D, No.4, pp.982--994, Apr. 2012.
  5. Takahiro Fukumori, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Takeshi Yamada, Kazumasa Yamamoto, Satoru Tsuge, Masakiyo Fujimoto, Tetsuya Takiguchi, Chiyomi Miyajima, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura, "CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments," Acoustical Science and Technology, Technical Report, Vol.32, No.5, pp.201--210, Sep. 2011.
  6. Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto, "Auditory perception versus automatic estimation of location and orientation of an acoustic source in a real environment," Acoustical Science and Technology, Vol.31, No.5, pp.309--319, Sep. 2010.
  7. Longbiao Wang, Kazue Minami, Kazumasa Yamamoto, Seiichi Nakagawa, "Speaker recognition by combining MFCC and phase information in noisy conditions," IEICE Transactions on Information and Systems, Vol.E93-D, No.9, pp.2397--2406, Sep. 2010.
  8. Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto, "Distant speech recognition using a microphone array network," IEICE Transactions on Information and Systems, Vol.E93-D, No.9, pp.2451--2462, Sep. 2010.
  9. Kazumasa Yamamoto, Masatoshi Tsuchiya, Seiichi Nakagawa, "Privacy protection for speech signals," Procedia - Social and Behavioral Sciences, Vol.2, No.1, pp.153--160, 2010.
  10. 藤井康寿, 山本一公, 北岡教英, 中川聖一, "重要文抽出に基づく講義音声の自動要約," 情報処理学会論文誌, Vol.51, No.3, pp.1094--1106, Mar. 2010.
  11. Kazumasa Yamamoto, Seiichi Nakagawa, "Privacy protection for speech information," Journal of Information Assurance and Security (JIAS), Vol.5, No.1, pp.284--292, 2010.
  12. Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto, "Automatic estimation of position and orientation of an acoustic source by a microphone array network," Journal of Acoustical Society of America, Vol.126, No.6, pp.3084--3094, Dec. 2009.
  13. Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura, "CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments," Acoustical Science and Technology, Technical Report, Vol.30, No.5, pp.363--371, Sep. 2009.
  14. 土屋雅稔, 小暮悟, 西崎博光, 太田健吾, 山本一公, 中川聖一, "日本語講義コンテンツコーパスの作成と分析," 情報処理学会論文誌, Vol.50, No.2, pp.448--459, Feb. 2009.
  15. Md. Babul Islam, Kazumasa Yamamoto, Hiroshi Matsumoto, "Mel-Wiener filter for Mel-LPC based speech recognition," IEICE Transactions on Information and Systems, Vol.E90-D, No.6, pp.935--942, Jun. 2007.
  16. Kenji Furihata, Takesaburo Yanagisawa, David K. Asano, Kazumasa Yamamoto, "Development of an experimental noise annoyance meter," Acta Acustica united with Acustica, Vol.93, No.1, pp.73--83, Jan./Feb. 2007.
  17. Satoshi Nakamura, Kazuya Takeda, Kazumasa Yamamoto, Takeshi Yamada, Shingo Kuroiwa, Norihide Kitaoka, Takanobu Nishiura, Akira Sasou, Mitsunori Mizumachi, Chiyomi Miyajima, Masakiyo Fujimoto, Toshiki Endo, "AURORA-2J: An evaluation framework for Japanese noisy speech recognition," IEICE Transactions on Information and Systems, Vol.E88-D, No.3, pp.535--544, Mar. 2005.
  18. 山本一公, 中川聖一, "セグメント単位入力HMMによる雑音環境下での音声認識," 電子情報通信学会論文誌, Vol.J83-D-II, No.12, pp.2526--2535, Dec. 2000.
  19. (Translation of the above paper) Kazumasa Yamamoto, Seiichi Nakagawa, "Speech recognition under noisy environments using segmental unit input HMM," Systems and Computers in Japan, Vol.33, No.8, pp.111--120, Jul. 2002.
  20. 山本一公, 中川聖一, "発話スタイルによる話速・音韻間距離・ゆう度の違いと音声認識性能の関係," 電子情報通信学会論文誌, Vol.J83-D-II, No.11, pp.2438-2447, Nov. 2000.
  21. (Translation of the above paper) Kazumasa Yamamoto, Seiichi Nakagawa, "Difference of speech rate, inter-phoneme's distance, and likelihood caused by speaking style and relationship among them and recognition performance," Systems and Computers in Japan, Vol.33, No.7, pp.50--60, Jun. 2002.
  22. 中川聖一, 花井建豪, 山本一公, 峯松信明, "HMMに基づく音声認識のための音節モデルとtriphoneモデルの比較," 電子情報通信学会論文誌, Vol.J83-D-II, No.6, pp.1412--1421, Jun. 2000.
  23. 中川聖一, 山本一公, "セグメント統計量を用いた隠れマルコフモデルによる音声認識," 電子情報通信学会論文誌, Vol.J79-D-II, No.12, pp.2032--2038, Dec. 1996.
  24. (Translation of the above paper) Seiichi Nakagawa, Kazumasa Yamamoto, "Speech recognition using hidden Markov models based on segmental statistics," Systems and Computers in Japan, Vol.28, No.7, pp.31--38, Jun. 1997.

国際会議 / International Conference Papers

Invited session papers

  1. Kazumasa Yamamoto, Masatoshi Tsuchiya, Seiichi Nakagawa, "Privacy protection for speech signal," Proc. International Conference on Security Camera Network, Privacy Protection and Community Safety 2009 (SPC 2009), Kiryu, Japan, 2009.
  2. Kazumasa Yamamoto, Seiichi Nakagawa, "Privacy protection for speech information," Proc. The Fifth International Conference on Information Assurance and Security (IAS 2009), pp.717--720, Xi'an, China, 2009.

Full paper reviewed

  1. Aditya Arie Nugraha, Kazumasa Yamamoto, Seiichi Nakagawa, "Single channel dereverberation method in log-Mel spectral domain using limited stereo data for distant speaker identification," Proc. 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2013), CD-ROM, Kaohsiung, Taiwan, Oct. 2013. (to appear)
  2. Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa, "Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound," Proc. 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2013), CD-ROM, Kaohsiung, Taiwan, Oct. 2013. (to appear)
  3. Yuki Todo, Ryota Nishimura, Kazumasa Yamamoto, Seiichi Nakagawa, "Development and evaluation of spoken dialog system with one agent and two agents," Proc. INTERSPEECH 2013, pp.1896--1900, Lyon, France, Aug. 2013.
  4. Ryota Nishimura, Yuki Todo, Kazumasa Yamamoto, Seiichi Nakagawa, "Chat-like spoken dialog system for a multi-party dialog incorporating two agents and a user," Proc. 1st International Conference on Human-Agent Interaction (iHAI 2013), pp.xxx--xxx, Sapporo, Japan, Aug. 2013.
  5. John McDonough, Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, Bhiksha Raj, "Speaker tracking with spherical microphone arrays," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp.xxxx--xxxx, Vancouver, Canada, May 2013.
  6. Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, John McDonough, Bhiksha Raj, Rita Singh, Ivan Tashev, "Microphone array processing for distant speech recognition: Towards real-world deployment," Proc. 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), CD-ROM, Hollywood, USA, Dec. 2012.
  7. Daisuke Enami, Faqiang Zhu, Kazumasa Yamamoto, Seiichi Nakagawa, "Soft-clustering technique for training data in age- and gender-independent speech recognition," Proc. 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), CD-ROM, Hollywood, USA, Dec. 2012.
  8. Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa, "Fast NMF based approach and improved VQ based approach for speech recognition from mixed sound," Proc. 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), CD-ROM, Hollywood, USA, Dec. 2012.
  9. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Deep-hidden conditional neural fields for continuous phoneme speech recognition," Proc. International Workshop on Statistical Machine Learning for Speech Processing, Kyoto, Japan, Mar. 2012.
  10. Kohta Shimada, Kazumasa Yamamoto, Seiichi Nakagawa, "Speaker identification using pseudo pitch synchronized phase information in voiced sound," Proc. 2011 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011), CD-ROM, Xi'an, China, Oct. 2011.
  11. Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa, "Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization," Proc. INTERSPEECH 2011, pp.1781--1784, Florence, Italy, Aug. 2011.
  12. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Hidden boosted MMI and hierarchical state posterior feature for automatic speech recognition based on hidden conditional neural fields," Proc. INTERSPEECH 2011, pp.1001--1004, Florence, Italy, Aug. 2011.
  13. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Automatic speech recognition using hidden conditional neural fields," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), pp.5036--5039, Prague, Czech, May 2011.
  14. Keisuke Iwami, Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Efficient out-of-vocabulary term detection by n-gram array indices with distance from a syllable lattice," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), pp.5664--5667, Prague, Czech, May 2011.
  15. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Large vocabulary speech recognition system: SPOJUS++," Proc. 11th WSEAS International Conference on Multimedia Systems & Signal Processing (MUSP '11), pp.110--118, Venice, Italy, Mar. 2011.
  16. Keisuke Iwami, Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results," Proc. IEEE Workshop on Spoken Language Technology (SLT 2010), pp.212--217, Berkeley, USA, Dec. 2010.
  17. Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Tetsuya Takiguchi, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura, "CENSREC-1-AV: An audio-visual corpus for noisy bimodal speech recognition," Proc. 2010 International Conference on Auditory and Visual Speech Processing (AVSP 2010), Hakone, Japan, Sep. 2010.
  18. Kazumasa Yamamoto, Eiichi Sueyoshi, Seiichi Nakagawa, "Speech recognition using long-term phase information," Proc. INTERSPEECH 2010, pp.1189--1192, Makuhari, Japan, Sep. 2010.
  19. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Improving the readability of class lecture ASR results using a confusion network," Proc. INTERSPEECH 2010, pp.3078--3081, Makuhari, Japan, Sep. 2010.
  20. Longbiao Wang, Kazue Minami, Kazumasa Yamamoto, Seiichi Nakagawa, "Speaker identification by combining MFCC and phase information in noisy environments," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.4502--4505, Dallas, USA, Mar. 2010.
  21. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa, "Improving the readability of class lecture ASR results using multiple hypotheses," IASTED Signal Processing, Pattern Recognition and Applications (SPPRA 2010), 678-091, pp.89--96, Innsbruck, Austria, Feb. 2010.
  22. Eiichi Sueyoshi, Kazumasa Yamamoto, Seiichi Nakagawa, "Component reduction technique for covariance matrix of multidimensional Gaussian distribution in speech recognition," 2009 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2009), pp.644--647, Sapporo, Japan, Oct. 2009.
  23. Masatoshi Tsuchiya, Satoru Kogure, Hiromitsu Nishizaki, Kazumasa Yamamoto, Seiichi Nakagawa, "Construction and Analysis of Corpus of Japanese Classroom Lecture Speech Contents," 2009 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2009), pp.344--349, Sapporo, Japan, Oct. 2009.
  24. Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto, "Estimating the position and orientation of an acoustic source with a microphone array network," Proc. INTERSPEECH 2009, pp.1127--1130, Brighton, UK, Sep. 2009.
  25. Alberto Yoshihiro Nakano, Kazumasa Yamamoto, Seiichi Nakagawa, "Directional acoustic source's position and orientation estimation approach by a microphone array network," Proc. IEEE 13th Digital Signal Processing Workshop and 5th IEEE Signal Processing Education Workshop, pp.606--611, Marco Island, USA, Jan. 2009.
  26. Masato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura, "CENSREC-4: Development of evaluation framework for distant-talking speech recognition under reverberant environments," Proc. INTERSPEECH 2008, pp.968--971, Brisbane, Australia, Sep. 2008.
  27. Satoru Kogure, Hiromitsu Nishizaki, Masatoshi Tsuchiya, Kazumasa Yamamoto, Shingo Togashi, Seiichi Nakagawa, "Speech recognition performance of CJLC: Corpus of Japanese Lecture Contents," Proc. INTERSPEECH 2008, pp.1554--1557, Brisbane, Australia, Sep. 2008.
  28. Yasuhisa Fujii, Kazumasa Yamamoto, Norihide Kitaoka, Seiichi Nakagawa, "Class lecture summarization taking into account consecutiveness of important sentences," Proc. INTERSPEECH 2008, pp.2438--2441, Brisbane, Australia, Sep. 2008.
  29. Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda and Satoshi Nakamura, "Evaluation framework for distant-talking speech recognition under reverberant environments: newest part of the CENSREC series -," Proc. the Sixth International Language Resources and Evaluation Conference (LREC'08), Marrakech, Morocco, May 2008.
  30. Alberto Yoshihiro Nakano, Longbiao Wang, Kazumasa Yamamoto, Seiichi Nakagawa, "Sound source localization by distributed microphone network," Proc. 2008 RISP International Workshop on Nonlinear Circuits and Signal Processing (NCSP 2008), pp.383--386, Gold Coast, Australia, Mar. 2008.
  31. Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura, "Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance," Proc. 2007 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2007), pp.607--612, Kyoto, Japan, Dec. 2007.
  32. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "An improved Mel-Wiener filter for Mel-LPC based speech recognition," Proc. INTERSPEECH 2006 - ICSLP, pp.45--48, Pittsburgh, USA, Sep. 2006.
  33. Naotoshi Nakatani, Kazumasa Yamamoto, Hiroshi Matsumoto, "Mel-LSP parameterization for HMM-based speech synthesis," Proc. International Conference on Speech and Computer (SPECOM 2006), pp.261--264, St. Petersburg, Russia, Jun. 2006.
  34. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "Evaluation of Mel-Wiener filter for Mel-LPC based speech recognition," Proc. International Conference on Speech and Computer (SPECOM 2005), pp.531--534, Patras, Greece, Oct. 2005.
  35. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "Frequency warped Wiener filtering for Mel-LPC based speech recognition," Proc. International Workshop on Nonlinear Signal and Image Processing (NSIP 2005), pp.298--301, Sapporo, Japan, May 2005.
  36. Masakiyo Fujimoto, Satoshi Nakamura, Kazuya Takeda, Shingo Kuroiwa, Takeshi Yamada, Norihide Kitaoka, Kazumasa Yamamoto, Mitsunori Mizumachi, Takanobu Nishiura, Akira Sasou, Chiyomi Miyajima, and Toshiki Endo, "CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework," International Workshop on Realworld Multimedia Corpora in Mobile Environment, pp.53--60, Tokyo, Japan, Mar. 2005.
  37. Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda,Shingo Kuroiwa, Norihide Kitaoka,Takeshi Yamada,Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso, Toshiki Endo, "AURORA-2J: Japanese speech data collection for performance evaluation of speech recognition in noise", Proc. International Conference on Speech and Language Technology/Oriental-COCOSDA (ICSLT2004/O-COCOSDA2004), New Delhi, India, Oct. 2004.
  38. Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda, Shingo Kuroiwa, Norihide Kitaoka, Takeshi Yamada, Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso and Toshiki Endo, "Data Collection and Evaluation of AURORA-2 Japanese Corpus", Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), pp.619--623, St. Thomas, U.S. Virgin Islands, Nov. 2003.
  39. Takeshi Yamada, Jiro Okada, Kazuya Takeda, Norihide Kitaoka, Masakiyo Fujimoto, Shingo Kuroiwa, Kazumasa Yamamoto, Takanobu Nishiura, Mitsunori Mizumachi, Satoshi Nakamura, "Integration of Noise Reduction Algorithms for AURORA2 Task," Proc. EUROSPEECH 2003, pp.1769--1772, Geneva, Switzerland, Sep. 2003.
  40. Hiroshi Matsumoto, Akihiko Shimizu, Kazumasa Yamamoto, "Evaluation of a generalized dynamic cepstrum in distant speech recognition," Proc. EUROSPEECH 2001, Vol.2, pp.881--884, Aalborg, Denmark, Sep. 2001.
  41. Seiichi Nakagawa, Kengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, "Comparison of syllable-based HMMs and triphone-based HMMs in Japanese speech recognition," Proc. International Workshop on Automatic Speech Recognition and Understanding (ASRU '99), pp.197--200, Keystone, USA, Dec. 1999.

Abstract reviewed

  1. Hiroshi Matsumoto, Tasuku Takei, Kazumasa Yamamoto, "Reverberation modeling on powerspectral trajectory for distant speech recognition," Proc. Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA 2005), p.b-19, Piscataway, USA, Mar. 2005.
  2. Hiroshi Matsumoto, Takumasa Ichikawa, Kazumasa Yamamoto, "Improved forward masking on a generalized logarithmic scale for robust speech recognition," Proc. International Congress on Acoustics (ICA 2004), pp.IV-2831--IV-2834, Kyoto Japan, Apr. 2004.
  3. Kazumasa Yamamoto, Taro Ikeda, Hiroshi Matsumoto, Masanobu Nishitani, Yasunaga Miyazawa, "Syllable-connected models for Japanese speech recognition," Proc. International Congress on Acoustics (ICA 2004), pp.V-3053--V-3056, Kyoto, Japan, Apr. 2004.
  4. Joo-Gon Kim, Kazumasa Yamamoto, Hyun-Yeol Chung, Seiichi Nakagawa, "A study on Korean speech recognition using MEL-LPC analysis," Proc. International Conference on Speech Processing (ICSP 2001), pp.639--642, Taejon, Korea, Aug. 2001.
  5. Hiroshi Matsumoto, Yoshihiro Ito, Akihiro Shimizu, Kazumasa Yamamoto, "A generalized dynamic cepstrum for hands-free speech recognition," Proc. International Workshop on Hands-free Speech Communication (HSC 2001), pp.115--118, Kyoto, Japan, Apr. 2001.
  6. Kazumasa Yamamoto, Seiichi Nakagawa, Hiroshi Matsumoto, "Evaluation of PMC for segmental unit input HMM in various environments," Proc. International Workshop on Hands-free Speech Communication (HSC 2001), pp.183--186, Kyoto, Japan, Apr. 2001.
  7. Yoshihiro Ito, Hiroshi Matsumoto, Kazumasa Yamamoto, "Forward masking on a generalized logarithmic scale for robust speech recognition," Proc. International Conference on Spoken Language Processing (ICSLP 2000), Vol.III, pp.530--533, Beijing, China, Oct. 2000.
  8. Kazumasa Yamamoto, Seiichi Nakagawa, "Relationship among speaking style, inter-phoneme's distance and speech recognition performance," Proc. International Conference on Spoken Language Processing (ICSLP 2000), Vol.II, pp.859--862, Beijing, China, Oct. 2000.
  9. Kazumasa Yamamoto, Seiichi Nakagawa, "Difference in speech recognition performance caused by difference in front-end devices and its compensation," Proc. WESTPRAC-VII, Vol.1, pp.85--88, Kumamoto, Japan, Oct. 2000.
  10. Kazumasa Yamamoto, Seiichi Nakagawa, "HMM composition of segmental unit input HMM for noisy speech recognition," Proc. EUROSPEECH '99, pp.2865--2868, Budapest, Hungary, Sep. 1999.
  11. Kengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa, "Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency," Proc. International Conference on Spoken Language Processing (ICSLP '98), pp.2935--2938, Sydney, Australia, Dec. 1998.
  12. Kazumasa Yamamoto, Seiichi Nakagawa, "Evaluation of segmental unit input HMM in noisy environments," Proc. International Conference on Speech Processing (ICSP '97), pp.643--648, Seoul, Korea, Aug. 1997.
  13. Seiichi Nakagawa, Kazumasa Yamamoto, "Evaluation of segmental unit input HMM," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), pp.439--442, Atlanta, USA, May 1996.
  14. Kazumasa Yamamoto, Seiichi Nakagawa, "Comparative evaluation of segmental unit input HMM and conditional density HMM," Proc. EUROSPEECH '95, pp.1615--1618, Madrid, Spain, Sep. 1995.

研究会・ワークショップ発表 / Domestic Conference or Workshop Papers

  1. 藤本雅清, 中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 水町光徳, 西浦敬信, 佐宗晃, 宮島千代美, 遠藤俊樹, "実走行車内 単語音声データベースCENSREC-3と共通評価環境の構築," 情報処理学会研究報告, 2005-SLP-55-13, pp.41--46, 2005.
  2. 中村哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 山本一公, 西浦敬 信, 佐宗晃, 水町光徳, 宮島千代美, 藤本雅清, 遠藤俊樹, "[招待講演]実環境下音声認識の評価の標準化とその動向," 第6回音声言語シンポジウム, 情報処理学会研究報告, 2004-SLP-54-24, pp.139--144, 2004.
  3. 藤本雅清, 中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 水町光徳, 西浦敬信, 佐宗晃, 宮島千代美, 遠藤俊樹, "CENSREC-3: 実走行車内単語音声データベースと評価環境の構築," 第6回音声言語シンポジウム, 情報処理学会研究報告, 2004-SLP-54-40, pp.235--240, 2004.
  4. 山田武志, 武田一哉, 北岡教英, 藤本雅清, 黒岩眞吾, 山本一公, 西浦敬信, 佐宗晃, 水町光徳, 遠藤俊樹, 中村哲, "AURORA-2Jを用いたETSI STQ Aurora WI008 Advanced DSR Frontendの評価," 第5回音声言語シンポジウム, 情報処理学会研究報告, 2003-SLP-49-18, pp.103--108, 2003.
  5. 池田太郎, 山本一公, 松本 弘, 西谷正信, 宮澤康永, "音節連鎖モデルによる大語彙連続音声認識," 第5回音声言語シンポジウム, 情報処理学会研究報告, 2003-SLP-49-26, pp.151--156, 2003.
  6. 山田武志, 岡田治郎, 武田一哉, 北岡教英, 藤本雅清, 黒岩眞吾, 山本一公, 西浦敬信, 水町光徳, 中村哲, "雑音下音声認識のための複数の前処理手法の統合とそのAURORA-2Jによる評価," 情報処理学会研究報告, 2003-SLP-47-18, pp.95--100, 2003.
  7. 山本一公, 中村哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 水町光徳, 藤本雅清, 西浦敬信, "AURORA-2J/AURORA-3Jデータベースとその評価ベースライン," 情報処理学会研究報告, 2003-SLP-47-19, pp.101--106, 2003.
  8. 中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 西浦敬信, 藤本雅清, 水町光徳, "SLP雑音下音声認識評価のためのWG: 評価データ収集について," 情報処理学会研究報告, 2002-SLP-45-9, pp.51--55, 2003.
  9. 中村哲, 西浦敬信, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 藤本雅清, 水町光徳, "ETSI AURORAプロジェクトの動向と雑音下音声認識評価ワーキンググループの活動報告," 人工知能学会研究報告, 第16回AIチャレンジ研究会, pp.57-62, 2002.
  10. 中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 西浦敬信, 藤本雅清, 水町光徳, "SLP雑音下音声認識評価ワーキンググループ活動報告," 情報処理学会研究報告, 2002-SLP-42-4, pp.17--22, 2002.
  11. 今井豊綱, 山本一公, 松本弘, "SVD法を用いた重回帰話者適応," 電子情報通信学会技術研究報告, SP2001-50, 2001.
  12. 山本一公, 森一将, 中川聖一, "話し言葉音声と読み上げ音声の連続音節認識による比較," 話し言葉の科学と工学ワークショップ講演予稿集, pp.117--124, 2001.
  13. 森一将, 山本一公, 中川聖一, "発話間のVQ歪を用いたオンライン話者交代識別と話者クラスタリング," 電子情報通信学会技術報告, SP2000-18, pp.17--24, 2000.

2000年以前については後に補完予定.


学会発表 / Domestic Society Meeting Papers

招待講演

  1. 山本一公, "SLP雑音下音声認識評価WGによる共通コーパスと評価の枠組み," 情報科学技術フォーラムFIT2003, イベント企画「雑音下音声認識に関する共通コーパスと評価」講演, 2003.

全国大会発表

  1. 草水智浩, 山本一公, 北岡教英, 中川聖一, "音声区間検出が音声認識性能に与える影響についての検討," 日本音響学会2008年春季研究発表会講演論文集, 1-Q-8, pp.169--172, 2008.
  2. 西浦敬信, 中山雅人, 傳田遊亀, 北岡教英, 山本一公, 山田武志, 藤本雅清, 柘植覚, 宮島千代美, 滝口哲也, 田村哲嗣, 小川哲司, 松田繁樹, 黒岩眞吾, 武田一哉, 中村哲, "残響下音声認識評価基盤(CENSREC-4)の構築," 日本音響学会2008年春季研究発表会講演論文集, 1-Q-10, pp.175--178, 2008.
  3. Alberto Yoshihiro Nakano, Longbiao Wang, Kazumasa Yamamoto, Seiichi Nakagawa, "Acoustic source localization based on distributed microphone arrays in a living room," The 2008 Spring Meeting of The Acoustical Society of Japan, 2-6-18, pp.703--706, 2008.
  4. 小暮悟, 西崎博光, 土屋雅稔, 山本一公, 中川聖一, "日本語講義コンテンツコーパスの構築と分析," 日本音響学会2007年秋季研究発表会講演論文集, 1-3-4, pp.13--16, 2007.
  5. 草水智浩, 山本一公, 北岡教英, 中川聖一, "VADが音声認識性能に与える影響," FIT2007(第6回情報科学技術フォーラム), E-055, pp.269--270, 2007.
  6. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "Improvement of Mel-Wiener filter for noisy speech recognition," The 2005 Autumn Meeting of The Acoustical Society of Japan, 3-7-16, pp.133--134, 2005.
  7. 竹居翼, 松本弘, 山本一公, "短時間スペクトル系列残響モデルの付加雑音下での推定と音声認識による評価," 日本音響学会2005年秋季研究発表会講演論文集, 3-7-23, pp.147--148, 2005.
  8. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "Wiener filter on warped frequency scale for Mel-LPC based speech recognition," The 2005 Spring Meeting of The Acoustical Society of Japan, 1-5-10, pp.19--20, 2005.
  9. 竹居翼, 松本弘, 山本一公, "パワートラジェクトリ上での残響のモデル化と遠隔音声認識への適用," 日本音響学会2005年春季研究発表会講演論文集, 2-Q-15, pp.127--138, 2005.
  10. 藤本雅清, 中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 水町光徳, 西浦敬信, 佐宗晃, 宮島千代美, 遠藤俊樹, "実走行車内音声認識の評価データベースCENSREC-3とその共通評価ベースライン," 日本音響学会2005年春季研究発表会講演論文集, 2-Q-23, pp.143--144, 2005.
  11. 小坂淳一, 山本一公, 松本弘, "音節誤り頻度に基づくトライ音節モデルの検討," 日本音響学会2004年秋季研究発表会講演論文集, 2-1-24, pp.83--84, 2004.
  12. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto, "Evaluation of Mel-LPC based auditory-like features on the Aurora 2 database," The 2004 Autumn Meeting of The Acoustical Society of Japan, 3-1-3, pp.115--116, 2004.
  13. 山田武志, 武田一哉, 北岡教英, 藤本雅清, 黒岩眞吾, 山本一公, 西浦敬信, 宮島千代美, 佐宗晃, 水町光徳, 遠藤俊樹, 中村哲, "AURORA-2Jと種々の評価指標を用いたETSI STQ Aurora WI008 Advanced DSR Frontendの評価," 日本音響学会2004年春季研究発表会講演論文集, 2-8-12, pp.83--84, 2004.
  14. 市川卓正, 山本一公, 松本 弘, "一般化動的ケプストラムを用いたフロントエンドのAURORA-2Jによる評価," 日本音響学会2004年春季研究発表会講演論文集, 2-8-15, pp.89--90, 2004.
  15. 伊藤正紀, 松本 弘, 山本一公, "VOCODER型分析合成におけるスペクトル包絡への位相付加に関する検討", 日本音響学会2004年春季研究発表会講演論文集, 2-P-14, pp.357--358, 2004.
  16. 池田太郎, 山本一公, 松本 弘, 西谷正信, 宮澤康永, "音節連鎖モデルの大語彙連続音声認識による評価," 日本音響学会2003年秋季研究発表会講演論文集, 2-6-6, pp.71--72, 2003.
  17. 渡邊友裕, 西崎博光, 山本一公, 北岡教英, 宇津呂武仁, 中川聖一, "複数の認識システムの出力の統合法による講演音声の認識," 日本音響学会2003年秋季研究発表会講演論文集, 3-6-9, pp.119--120, 2003.
  18. 山本一公, 中村 哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 水町光徳, 西浦敬信, 藤本雅清, 佐宗晃, 遠藤俊樹, "雑音下音声認識共通評価データベースAURORA-2Jとその評価ベースライン," 日本音響学会2003年秋季研究発表会講演論文集, 3-Q-11, pp.147--148, 2003.
  19. 山田武志, 岡田治郎, 武田一哉, 北岡教英, 藤本雅清, 黒岩眞吾, 山本一公, 西浦敬信, 佐宗晃, 水町光徳, 遠藤俊樹, 中村哲, "複数の雑音抑圧手法の統合によるロバスト音声認識とそのAURORA-2Jによる評価," 日本音響学会2003年秋季研究発表会講演論文集, 3-Q-12, pp.149--150, 2003.
  20. 池田太郎, 山本一公, 松本弘, 西谷正信, 宮澤康永, "音声認識における音節連鎖モデルの検討," 日本音響学会2003年春季研究発表会講演論文集, 1-4-3, pp.5--6, 2003.
  21. 市川卓正, 山本一公, 松本弘, "一般化動的ケプストラムを用いたフロントエンドの耐雑音性の改良," 日本音響学会2003年春季研究発表会講演論文集, 1-4-24, pp.53--54, 2003.
  22. 山本一公, 池田太郎, 松本弘, 西谷正信, 宮澤康永, "コンパクトで高精度な音節モデルの検討," 日本音響学会2002年秋季研究発表会講演論文集, 1-9-22, pp.43--44, 2002.
  23. 平林裕人, 山本一公, 松本弘, "メルLPC分析に基づく音声認識への聴覚特性の導入," 日本音響学会2002年春季研究発表会講演論文集, Vol.I, pp.5--6, 2002.
  24. 清水明彦, 山本一公, 松本弘, "残響付加音声に対する動的ケプストラムの最適化," 日本音響学会2001年秋季研究発表会講演論文集, Vol.I, pp.43--44, 2001.
  25. 今井豊綱, 山本一公, 松本弘, "SVD法による重回帰話者適応化法の改良," 日本音響学会2001年秋季研究発表会講演論文集, Vol.I, pp.125-126, 2001.
  26. 伊藤祥宏, 清水明彦, 山本一公, 松本弘, "動的一般化ケプストラムによるハンズフリー音声認識の検討," 日本音響学会2001年春季研究発表会講演論文集, Vol.I, pp.67--68, 2001.
  27. 諸戸正憲, 山本一公, 松本弘, "コンテキスト依存音節単位HMMの評価," 日本音響学会2001年春季研究発表会講演論文集, Vol.I, pp.95--96, 2001.
  28. 今井豊綱, 山本一公, 松本弘, "SVD法に基づく重回帰話者適応の大語彙連続音声認識による評価," 日本音響学会2001年春季研究発表会講演論文集, Vol.I, pp.121--122, 2001.
  29. 今井豊綱, 山本一公, 松本弘, "2次項を含む重回帰話者適応の検討," 日本音響学会2000年秋季研究発表会講演論文集, Vol.I, pp.11--12, 2000.
  30. 伊藤祥宏, 山本一公, 松本弘, "一般化対数関数目盛上のフォワードマスキングによる耐環境性の改善," 日本音響学会2000年秋季研究発表会講演論文集, Vol.I, pp.69--70, 2000.

2000年以前については後に補完予定.

支部大会発表

  1. 傍島裕迪, 山本一公, 松本弘, "講演音声認識における認識誤りの分析," 平成17年度電子情報通信学会信越支部大会講演論文集, pp.173--174, 2005.
  2. 和田真祐, 山本一公, 松本弘, "多段線形近似を用いた雑音へのHMM適応," 平成17年度電子情報通信学会信越支部大会講演論文集, pp.177--178, 2005.
  3. 中谷尚俊, 山本一公, 松本弘, "音声合成を目的としたLSP領域とケプストラム領域のセントロイドスペクトルの比較," 平成17年度電子情報通信学会信越支部大会講演論文集, pp.179--180, 2005.
  4. 今井宏, 竹井翼, 山本一公, 松本弘, "残響音声認識における変調フィルタの評価," 平成17年度電子情報通信学会信越支部大会講演論文集, pp.181--182, 2005.
  5. 桃井潤一, 山本一公, 松本弘, "発話速度別モデルの構築法に関する検討," 平成17年度電子情報通信学会信越支部大会講演論文集, pp.231--232, 2005.
  6. 娜日蘇, 山本一公, 松本弘, "音節を単位としたHMM音声合成の検討," 平成16年度電子情報通信学会信越支部大会講演論文集, pp.229--230, 2004.
  7. 伊藤正紀, 山本一公, 松本弘, "PSOLA法における単位波形位相が合成音声品質に及ぼす影響," 平成16年度電子情報通信学会信越支部大会講演論文集, pp.231--232, 2004.
  8. 今井宏, 山本一公, 松本弘, "音声認識におけるMFCC分析特性の最適化," 平成16年度電子情報通信学会信越支部大会講演論文集, pp.239--240, 2004.
  9. 桃井潤一, 山本一公, 松本弘, "音声認識における発話速度変動に頑健な音響モデルの検討," 平成16年度電子情報通信学会信越支部大会講演論文集, pp.411--412, 2004.
  10. 竹居翼, 山本一公, 松本弘, "スペクトル系列に関する残響のモデル化と音声認識に関する評価," 平成16年度電子情報通信学会信越支部大会講演論文集, pp.421--422, 2004.
  11. 伊藤正紀, 松本 弘, 山本一公, "PSE分析・合成系における位相情報付加による合成音声品質改善の検討," 平成15年度電子情報通信学会信越支部大会講演論文集, pp.191--192, 2003.
  12. 小坂淳一, 山本一公, 松本 弘, "音節モデルによる講演音声認識と謝り傾向の調査," 平成15年度電子情報通信学会信越支部大会講演論文集, pp.193--194, 2003.
  13. 染谷貴史, 松本弘, 山本一公, "MEL-LPC-PSOLAに基づく音声分析合成系の改良," 平成14年度電子情報通信学会信越支部大会講演論文集, A7, pp.37--38, 2002.
  14. 清水明彦, 松本弘, 山本一公, "残響の短時間スペクトルの影響と補償に関する検討," 平成14年度電子情報通信学会信越支部大会講演論文集, A8, pp.39--40, 2002.
  15. 市川卓正, 松本弘, 山本一公, "Aurora2による一般化動的ケプストラムの評価," 平成14年度電子情報通信学会信越支部大会講演論文集, A9, pp.41--42, 2002.
  16. 池田太郎, 山本一公, 松本弘, "音節と音節連鎖モデルによる連続音声認識," 平成14年度電子情報通信学会信越支部大会講演論文集, A10, pp.43--44, 2002.
  17. 平林裕人, 山本一公, 松本弘, "メルLPC分析に聴覚特性を導入した音声認識フロントエンド," 平成14年度電子情報通信学会信越支部大会講演論文集, A11, pp.45--46, 2002.
  18. 染谷貴史, 山本一公, 松本弘, "Mel-LPCを用いたPSOLA音声変換合成システム," 平成13年度電子情報通信学会信越支部大会講演論文集, pp.53--54, 2001.
  19. 平林裕人, 山本一公, 松本弘, "時間領域PLPによる音声認識の検討," 平成13年度電子情報通信学会信越支部大会講演論文集, pp.55--56, 2001.
  20. 安藤章悟, 山本一公, 松本弘, "前後の音韻環境を考慮した音声認識のための音節モデルの考察," 平成13年度電子情報通信学会信越支部大会講演論文集, pp.56--57, 2001.