OAK

GIST Library Login

Metadata Downloads

Abstract: In this paper, a dysarthric speech recognition error-correction method
in a weighted finite state transducer (WFST) framework is proposed to improve
the performance of dysarthric automatic speech recognition (ASR). To this end,
pronunciation variation models are constructed from a context-dependent confusion
matrix based on a weighted Kullback-Leibler (KL) distance between triphones.
Then, a WFST is finally constructed by combining the WFST of the
baseline ASR, the constructed pronunciation variation models, a lexicon, and a
language model. It is shown from the dysarthric ASR experiments that a
WFST-based ASR system employing the proposed error-correction method
achieves relative average word error rate reduction of 19.73%, compared to an
ASR system without any error-correction method.

Appears in Collections:: Department of Electrical Engineering and Computer Science > 1. Journal Articles

공개 및 라이선스

qrcode

OAK GIST Scholar는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.