Why speech recognizers make errors? A robustness view
- Author(s)
- Kim, Hong Kook; Mazin Rahim
- Type
- Conference Paper
- Citation
- 8th International Conference on Spoken Language Processing (ICSLP 2004), pp.1645 - 1648
- Issued Date
- 2004-10-04
- Abstract
- The performance of large vocabulary speech recognizers often varies depending on the input speech and the quality of the trained models. The particular attributes that cause recognition errors are a research area that has not been well studied. This paper addresses this issue from a robustness perspective using a large amount of field data collected from natural language dialog services. In particular, we present a method for tracking time-varying or nonstationary extraneous events, such as music, background noise, etc. We show that this measure is a better predictor of recognition errors than a standard measure of stationary signal-to-noise ratio (SNR). Combining the two measures provides a data selection algorithm for detecting problematic speech.
- Publisher
- International Speech Communication Association
- Conference Place
- KO
Jeju Island, Korea
- URI
- https://scholar.gist.ac.kr/handle/local/28483
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.