OAK

Why speech recognizers make errors? A robustness view

Metadata Downloads
Author(s)
Kim, Hong KookMazin Rahim
Type
Conference Paper
Citation
8th International Conference on Spoken Language Processing (ICSLP 2004), pp.1645 - 1648
Issued Date
2004-10-04
Abstract
The performance of large vocabulary speech recognizers often varies depending on the input speech and the quality of the trained models. The particular attributes that cause recognition errors are a research area that has not been well studied. This paper addresses this issue from a robustness perspective using a large amount of field data collected from natural language dialog services. In particular, we present a method for tracking time-varying or nonstationary extraneous events, such as music, background noise, etc. We show that this measure is a better predictor of recognition errors than a standard measure of stationary signal-to-noise ratio (SNR). Combining the two measures provides a data selection algorithm for detecting problematic speech.
Publisher
International Speech Communication Association
Conference Place
KO
Jeju Island, Korea
URI
https://scholar.gist.ac.kr/handle/local/28483
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.