OAK

Soft-masking based denoising and dereverberation for binaural speech separation in reverberant environments

Metadata Downloads
Abstract
In this paper, we propose a soft-masking based denoising and dereverberation method for binaural speech separation in order to improve the performance of speech recognition under reverberant conditions. For each time-frequency bin, the interaural time difference (ITD) is first computed, and then the signal-to-noise ratio (SNR) is estimated as the ratio of the powers of the target speech and noise signals from the ITD. Next, a denoising mask is estimated from the estimated SNR. Subsequently, a dereverberation mask is also obtained according to an estimate of the direct-to-reverberant energy ratio (DRR). In particular, to estimate the DRR of the current frame, the reverberant power is computed by summing the exponentially down-weighted powers of previous frames. It is demonstrated here that a binaural speech separation system with the proposed denoising and dereverberation masks outperforms a system with a conventional spatial and temporal mask (STM) in reverberant and noisy environments, in terms of speech recognition performance. © 2013 ICIC International.
Author(s)
Park, J.H.Kim, Hong Kook
Issued Date
2013-03
Type
Article
URI
https://scholar.gist.ac.kr/handle/local/15631
Publisher
ICIC Express Letters Office
Citation
ICIC Express Letters, v.7, no.3 A, pp.681 - 686
ISSN
1881-803X
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.