Non-negative matrix factorization based noise reduction for noise robust automatic speech recognition
- Abstract
- In this paper, we propose a noise reduction method based on non-negative matrix factorization (NMF) for noise-robust automatic speech recognition (ASR). Most noise reduction methods applied to ASR front-ends have been developed for suppressing background noise that is assumed to be stationary rather than non-stationary. Instead, the proposed method attenuates non-target noise by a hybrid approach that combines a Wiener filtering and an NMF technique. This is motivated by the fact that Wiener filtering and NMF are suitable for reduction of stationary and non-stationary noise, respectively. It is shown from ASR experiments that an ASR system employing the proposed approach improves the average word error rate by 11.9%, 22.4%, and 5.2%, compared to systems employing the two-stage mel-warped Wiener filter, the minimum mean square error log-spectral amplitude estimator, and NMF with a Wiener post-filter, respectively. © 2012 Springer-Verlag.
- Author(s)
- Kim, Seon Man; Park, Ji Hun; Kim, Hong Kook; Lee, Sung Joo; Lee, Yun Keun
- Issued Date
- 2012-03
- Type
- Conference Paper
- DOI
- 10.1007/978-3-642-28551-6_42
- URI
- https://scholar.gist.ac.kr/handle/local/23897
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.