OAK

A Voice-Driven Scene-Mode Recommendation Service for Portable Digital Imaging Devices

Metadata Downloads
Abstract
In this paper, we propose a voice-driven scene-mode recommendation service in order to more easily select scene-modes on pot-table digital imaging devices such as digital cameras and camcorders. In other words, the proposed service is designed to recommend or automatically change the scene-mode by recognizing a user's voice command regarding scene or scene-related words. To realize such a service, we implement a system which is mainly composed of voice activity detection, automatic speech recognition (ASR), utterance verification, and word-to-scene-mode mapping. However, several optimization methods should be applied since portable digital imaging devices operate on embedded systems with limited resources. In addition, a speech adaptation database for acoustic models is developed such that the ASR system can adjust to the characteristics of the microphones and operating environments. Final v. the performance of the voice-driven scene-mode recommendation system is measured in terms of processing time and scene-mode recognition accuracy (SMRA). It is shown from the experiments that the average processing time and the average SMRA are around 500 ms and 98.0% for 50 scene-related words, respectively, and 1200 ms and 96.8%,for 200 scene-related words.(1)
Author(s)
Oh, Yoo RheeYoon, Jae SamKim, Hong KookKim, Myung BoKim, Sang Ryong
Issued Date
2009-11
Type
Article
URI
https://scholar.gist.ac.kr/handle/local/16929
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Citation
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.55, no.4, pp.1739 - 1747
ISSN
0098-3063
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.