OAK

Dual Microphone Speech Enhancement Based on Statistical Modeling of Interchannel Phase Difference

Metadata Downloads
Abstract
The interchannel phase difference (IPD) may be one of the most widely-used spatial cues in multichannel speech processing, and has been used in beamformers and post filters for speech enhancement. The coherence, which is also used as a feature for speech enhancement, can provide information on the reliability of the IPD for the estimation of the speech presence probability (SPP). In this paper, we propose dual microphone speech enhancement adopting a posteriori SPP estimation based on statistical modeling of the IPD. The marginal distribution of the IPD is derived from the distribution of the relative transfer function which is parameterized with the IPD and coherence, with a single assumption that the observed discrete Fourier transform (DFT) coefficients in each frequency are distributed according to a complex bivariate Gaussian distribution. Given the direction of arrival of the desired signal, the a posteriori SPP is obtained using the IPD distributions with and without the information on the location of the interfering source, and is applied to speech enhancement. Experimental results for various types and locations of noise, signal-to-noise ratios, reverberation times, and locations of the target source showed that the proposed method outperformed previously proposed approaches utilizing IPD information.
Author(s)
Hwang, SoojoongKim, MinseungShin, Jong Won
Issued Date
2022-08
Type
Article
DOI
10.1109/taslp.2022.3202121
URI
https://scholar.gist.ac.kr/handle/local/10668
Publisher
IEEE Advancing Technology for Humanity
Citation
IEEE/ACM Transactions on Speech and Language Processing, v.30, pp.2865 - 2874
ISSN
2329-9290
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.