OAK

A Smart Background Music Mixing Algorithm for Portable Digital Imaging Devices

Metadata Downloads
Abstract
In this paper, we propose a smart background music (BGM) mixing algorithm for portable digital imaging devices to enable users to enjoy video content with BGM. The proposed algorithm automatically adjusts the BGM output energy based on the activity and energy of foreground audio (FGA) contained in a video file. To this end, the proposed algorithm classifies each segment of FGA as speech, non-speech, or a mixed signal. After that, it estimates a scale factor for mixing FGA and BGM according to the signal classification result and the energy of FGA. In addition, a fade-in and fade-out process is incorporated in the proposed algorithm in order to improve the perceptual quality of output audio at the boundaries where signal classification is changed. In order to demonstrate the effectiveness of the proposed algorithm, we implement it on a portable digital imaging device in real time and compare the user's preference of the proposed algorithm with those of conventional algorithms that mixes FGA with BGM based on voice activity detection or a predefined fixed scale factor. It is shown from the experiments that the proposed algorithm is pretty much preferred by around 79%, compared to the conventional algorithms.
Author(s)
Kang, Jin AhChun, Chan JunKim, Hong KookKim, Myeong BoKim, Sang Ryong
Issued Date
2011-08
Type
Article
DOI
10.1109/TCE.2011.6018882
URI
https://scholar.gist.ac.kr/handle/local/16239
Publisher
Institute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Consumer Electronics, v.57, no.3, pp.1258 - 1263
ISSN
0098-3063
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.