OAK

Individual Sub-band Estimation Approach to Bandwidth Extension and Enhancement of Coded Speech

Metadata Downloads
Author(s)
Youngwon Choi
Type
Thesis
Degree
Master
Department
대학원 전기전자컴퓨터공학부
Advisor
Shin, Jong Won
Abstract
The streaming Sound EnhAncement Network (SEANet) has demonstrated impressive performance for speech bandwidth extension (BWE) with low latency and computational complexity. Although the streaming SEANet was designed for voice communication systems, it was not tested with decoded signals that included coding artifacts. Our preliminary experiment showed that the output of the streaming SEANet for the decoded speech had room for improvement even if it was trained with decoded speeches, possibly because it should perform the BWE and the coded speech enhancement (CSE) at once. In this work, we propose to utilize two streaming SEANets in parallel, which are dedicated to the narrowband CSE and the generation of the upper band speech signal, respectively. Experimental results showed that the proposed model outperformed a bigger streaming SEANet trained to carry out both tasks in terms of the PESQ scores and the MUSHRA test.
URI
https://scholar.gist.ac.kr/handle/local/19400
Fulltext
http://gist.dcollection.net/common/orgView/200000883684
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.