OAK

GIST Library Login

Metadata Downloads

Abstract: In this paper, we propose a lexicon optimization method based on
confusability measure (CM) in order to reduce the decoding time for a large vocabulary
continuous speech recognition (LVCSR) system. When lexicon is built
or expanded for unseen words by using grapheme-to-phoneme (G2P) conversion,
the lexicon size increases since G2P is generally realized by 1-to-N-best
mapping. Thus, the proposed method prunes the confusable words in the lexicon
by a CM that is defined a linguistic distance between two phonemic sequences.
It is demonstrated from LVCSR experiments that the proposed lexicon
optimization method achieves a relative real-time factor reduction of 23.13% on
a task on the Wall Street Journal, compared to the 1-to-4-best G2P converted
lexicon approach.

Appears in Collections:: Department of Electrical Engineering and Computer Science > 1. Journal Articles

공개 및 라이선스

qrcode

OAK GIST Scholar는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.