OAK

깊은 신경망 특징 기반 화자 검증 시스템의 성능 비교

Metadata Downloads
Author(s)
김대현성우경김홍국
Type
Article
Citation
말소리와 음성과학, v.7, no.4, pp.9 - 16
Issued Date
2015-12
Abstract
In this paper, several experiments are performed according to deep neural network (DNN) based features for the performance comparison of speaker verification (SV) systems. To this end, input features for a DNN, such as mel-frequency cepstral coefficient (MFCC), linear-frequency cepstral coefficient (LFCC), and perceptual linear prediction (PLP), are first compared in a view of the SV performance. After that, the effect of a DNN training method and a structure of hidden layers of DNNs on the SV performance is investigated depending on the type of features. The performance of an SV system is then evaluated on the basis of I-vector or probabilistic linear discriminant analysis (PLDA) scoring method. It is shown from SV experiments that a tandem feature of DNN bottleneck feature and MFCC feature gives the best performance when DNNs are configured using a rectangular type of hidden layers and trained with a supervised training method.
Publisher
한국음성학회
ISSN
2005-8063
DOI
10.13064/KSSS.2015.7.4.009
URI
https://scholar.gist.ac.kr/handle/local/14464
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.