A Voice-Driven Scene-Mode Recommendation Service for Portable Digital Imaging Devices
- Abstract
- In this paper, we propose a voice-driven scene-mode recommendation service in order to more easily select scene-modes on pot-table digital imaging devices such as digital cameras and camcorders. In other words, the proposed service is designed to recommend or automatically change the scene-mode by recognizing a user's voice command regarding scene or scene-related words. To realize such a service, we implement a system which is mainly composed of voice activity detection, automatic speech recognition (ASR), utterance verification, and word-to-scene-mode mapping. However, several optimization methods should be applied since portable digital imaging devices operate on embedded systems with limited resources. In addition, a speech adaptation database for acoustic models is developed such that the ASR system can adjust to the characteristics of the microphones and operating environments. Final v. the performance of the voice-driven scene-mode recommendation system is measured in terms of processing time and scene-mode recognition accuracy (SMRA). It is shown from the experiments that the average processing time and the average SMRA are around 500 ms and 98.0% for 50 scene-related words, respectively, and 1200 ms and 96.8%,for 200 scene-related words.(1)
- Author(s)
- Oh, Yoo Rhee; Yoon, Jae Sam; Kim, Hong Kook; Kim, Myung Bo; Kim, Sang Ryong
- Issued Date
- 2009-11
- Type
- Article
- URI
- https://scholar.gist.ac.kr/handle/local/16929
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.