OAK

깊은 신경망 특징 기반 화자 검증 시스템의 성능 비교

Metadata Downloads
Alternative Title
Performance Comparison of Deep Feature Based Speaker Verification Systems
Abstract
In this paper, several experiments are performed according to deep neural network (DNN) based features for the performance comparison of speaker verification (SV) systems. To this end, input features for a DNN, such as mel-frequency cepstral coefficient (MFCC), linear-frequency cepstral coefficient (LFCC), and perceptual linear prediction (PLP), are first compared in a view of the SV performance. After that, the effect of a DNN training method and a structure of hidden layers of DNNs on the SV performance is investigated depending on the type of features. The performance of an SV system is then evaluated on the basis of I-vector or probabilistic linear discriminant analysis (PLDA) scoring method. It is shown from SV experiments that a tandem feature of DNN bottleneck feature and MFCC feature gives the best performance when DNNs are configured using a rectangular type of hidden layers and trained with a supervised training method.
Author(s)
김대현성우경김홍국
Issued Date
2015-12
Type
Article
DOI
10.13064/KSSS.2015.7.4.009
URI
https://scholar.gist.ac.kr/handle/local/14464
Publisher
한국음성학회
Citation
말소리와 음성과학, v.7, no.4, pp.9 - 16
ISSN
2005-8063
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.