깊은 신경망 특징 기반 화자 검증 시스템의 성능 비교
- Alternative Title
- Performance Comparison of Deep Feature Based Speaker Verification Systems
- Abstract
- In this paper, several experiments are performed according to deep neural network (DNN) based features for the performance comparison of speaker verification (SV) systems. To this end, input features for a DNN, such as mel-frequency cepstral coefficient (MFCC), linear-frequency cepstral coefficient (LFCC), and perceptual linear prediction (PLP), are first compared in a view of the SV performance. After that, the effect of a DNN training method and a structure of hidden layers of DNNs on the SV performance is investigated depending on the type of features. The performance of an SV system is then evaluated on the basis of I-vector or probabilistic linear discriminant analysis (PLDA) scoring method. It is shown from SV experiments that a tandem feature of DNN bottleneck feature and MFCC feature gives the best performance when DNNs are configured using a rectangular type of hidden layers and trained with a supervised training method.
- Author(s)
- 김대현; 성우경; 김홍국
- Issued Date
- 2015-12
- Type
- Article
- DOI
- 10.13064/KSSS.2015.7.4.009
- URI
- https://scholar.gist.ac.kr/handle/local/14464
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.