OAK

Dual Microphone Voice Activity Detection Exploiting Interchannel Time and Level Differences

Metadata Downloads
Abstract
The two most important spatial cues in human auditory system may be the interaural time difference and the interaural level difference. There have been many attempts to utilize the time difference of arrival (TDoA) and level difference between two microphone signals for voice activity detection (VAD). In this letter, we propose a dual microphone VAD algorithm based on a support vector machine for which the input vector consists of both TDoA-based and level difference-based features. Several candidates for the feature combination have been compared using various TDoA-related and level difference-related features. Experimental results showed that the proposed VAD algorithm outperformed a standardized single microphone VAD, VADs based on the TDoA or level difference, and logical combination of them in various noise environments. © 2016 IEEE.
Author(s)
Park, JaehoonJin, Yu GwangHwang, SoojoongShin, Jong Won
Issued Date
2016-10
Type
Article
DOI
10.1109/LSP.2016.2597360
URI
https://scholar.gist.ac.kr/handle/local/14055
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Citation
IEEE Signal Processing Letters, v.23, no.10, pp.1335 - 1339
ISSN
1070-9908
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.