OAK

Efficient distribution of feature parameters for speech recognition in network environments

Metadata Downloads
Author(s)
Yoon, JSLee, GHKim, Hong Kook
Type
Article
Citation
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2005, PT 1, v.3767, pp.477 - 488
Issued Date
2005-11
Abstract
In network or ubiquitous environments, there are difficulties in performing large vocabulary speech recognition by a small device due to its limited power. Therefore, an approach, so-called distributed speech recognition (DSR), that; distributes the processing modules of automatic speech recognition into a device and a server has been attractive. Of all processing modules of DSR, quantization of feature parameters plays a main role in terms of the transmission bandwidth and the recognition performance. In this paper, we propose an efficient quantizer of feature parameters by incorporating the correlation between successive analysis frames of speech. The proposed quantizer is based on the predictive multi-stage vector quantization scheme and designed with different bit rates by trading off with the performance of speech recognition. It is shown from speech recognition experiments that the DSR system employing the proposed quantization method can reduce a bit rate by 20% with a comparable recognition performance to the ETSI DSR standard.
Publisher
SPRINGER-VERLAG BERLIN
ISSN
0302-9743
URI
https://scholar.gist.ac.kr/handle/local/18015
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.