OAK

Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments

Metadata Downloads
Abstract
This paper presents a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on noisy speech recognition tasks. The principal contribution of this paper is an approach for cepstrum-domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy speech enhancement. This approach is applied in combination with other feature compensation algorithms to compensating ASR features obtained from a mel-filterbank cepstrum coefficient front-end. Performance comparisons are made with respect to the application of the minimum mean squared error log spectral amplitude (MMSE-LSA) estimator based speech enhancement algorithm prior to feature analysis. An experimental study is presented where the feature compensation approaches described in the paper are found to greatly reduce ASR word error rate compared to uncompensated features under environmental and channel mismatched conditions.
Author(s)
Kim, Hong KookRose, RC
Issued Date
2003-09
Type
Article
DOI
10.1109/TSA.2003.815515
URI
https://scholar.gist.ac.kr/handle/local/18334
Publisher
Institute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Speech and Audio Processing, v.11, no.5, pp.435 - 446
ISSN
1063-6676
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.