OAK

Light-Weight Causal Speech Enhancement with Bone-Conduction

Metadata Downloads
Author(s)
이상윤
Type
Thesis
Degree
Master
Department
대학원 AI대학원
Advisor
Shin, Jong Won
Abstract
Speech enhancement aims to improve the quality of speech degraded by various types of noise, particularly under challenging conditions such as extremely low signal- to-noise ratio (SNR). Traditional methods predominantly rely on speech data captured by air-conduction (AC), which are highly susceptible to noise. This makes speech en- hancement at low SNRs a challenge. In contrast, bone-conduction (BC) is more robust to noise but provide information constrained to a limited frequency bandwidth. In this paper, we propose a novel fusion module that effectively integrates information from both air-conduction and bone-conduction. Additionally, we introduce a light-weight, causal network designed for low computational complexity, making it suitable for de- ployment on resource-constrained devices. Experimental evaluations demonstrate that the proposed model significantly outperforms the baseline, achieving superior speech quality while reducing model size without an increase in computational complexity.
URI
https://scholar.gist.ac.kr/handle/local/19455
Fulltext
http://gist.dcollection.net/common/orgView/200000859955
Alternative Author(s)
이상윤(Sangyun Lee)
Appears in Collections:
Department of AI Convergence > 3. Theses(Master)
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.