OAK

DNN-based Decoded Audio Enhancement using Vector-Quantized Side Information

Metadata Downloads
Author(s)
Young Ju Cheon
Type
Thesis
Degree
Master
Department
대학원 전기전자컴퓨터공학부
Advisor
Shin, Jong Won
Abstract
Audio codecs have been studied for several decades to achieve better perceptual quality with lower bitrate. The perceptual audio codecs may induce audible coding artifacts when encoding audio at low bitrates. To solve these problems, codec post-processing methods have been studied. Recently, deep neural networks (DNN) based approaches have been studied to enhance the quality of the decoded signals. If the number of patterns of residual, which is the difference between the original signal and the decoded signal is limited, the decoded signal can be improved by learning the patterns as a codebook. In this paper, we propose the neural network-based decoded audio enhancement using side information. Our experiments on TIMIT and VCTK dataset demonstrate that the proposed method is more efficient than DNN-based enhancement without side information at a higher bitrate.
URI
https://scholar.gist.ac.kr/handle/local/33179
Fulltext
http://gist.dcollection.net/common/orgView/200000907569
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.