OAK

Deep Learning Based Personalized HRTF Estimation Using Anthropometric parameters and Ear Images

Metadata Downloads
Author(s)
Geon Woo Lee
Type
Thesis
Degree
Master
Department
대학원 전기전자컴퓨터공학부
Advisor
Kim, Hong Kook
Abstract
This paper proposes a personalized head-related transfer function (HRTF) estimation method based on deep neural networks by using anthropometric measurements and ear images. A feedforward deep neural network (DNN) using the anthropometric measurements regarding the head and torso and a convolutional neural network (CNN) using the ear images are constructed. After that, the outputs of these two networks are merged into another DNN for estimation of the personalized HRTF. To evaluate the performance of the proposed method, objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated one are measured. Consequently, the proposed method provides the lower RMSE and LSD than the DNN-based method using anthropometric data without pinna measurements, respectively. Next, a sound localization test is performed for the subjective evaluation. As a result, it is shown that the proposed method can localize sound sources with higher accuracy than the average HRTF method and DNN-based method, respectively. In addition, the reductions of the front/back confusion rate are achieved by the proposed method, compared to the average HRTF method and DNN-based method, respectively.
URI
https://scholar.gist.ac.kr/handle/local/32523
Fulltext
http://gist.dcollection.net/common/orgView/200000910563
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.