Light-Weight Causal Speech Enhancement with Bone-Conduction
- Author(s)
- 이상윤
- Type
- Thesis
- Degree
- Master
- Department
- 대학원 AI대학원
- Advisor
- Shin, Jong Won
- Abstract
- Speech enhancement aims to improve the quality of speech degraded by various types of noise, particularly under challenging conditions such as extremely low signal- to-noise ratio (SNR). Traditional methods predominantly rely on speech data captured by air-conduction (AC), which are highly susceptible to noise. This makes speech en- hancement at low SNRs a challenge. In contrast, bone-conduction (BC) is more robust to noise but provide information constrained to a limited frequency bandwidth. In this paper, we propose a novel fusion module that effectively integrates information from both air-conduction and bone-conduction. Additionally, we introduce a light-weight, causal network designed for low computational complexity, making it suitable for de- ployment on resource-constrained devices. Experimental evaluations demonstrate that the proposed model significantly outperforms the baseline, achieving superior speech quality while reducing model size without an increase in computational complexity.
- URI
- https://scholar.gist.ac.kr/handle/local/19455
- Fulltext
- http://gist.dcollection.net/common/orgView/200000859955
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.