OAK

Initialization for NMF-based audio source separation using priors on encoding vectors

Metadata Downloads
Abstract
Nonnegative matrix factorization (NMF) has shown good performances on blind audio source separation (BASS). While the NMF analysis is a non-convex optimization problem when both the basis and encoding matrices need to be estimated simultaneously, the source separation step of the NMF-based BASS with a fixed basis matrix has been considered convex. However, because the basis matrix for the BASS is typically constructed by concatenating the basis matrices trained with individual source signals, the subspace spanned by the basis vectors for one source may overlap with that for other sources. In this paper, we have shown that the resulting encoding vector is not unique when the subspaces spanned by basis vectors for the sources overlap, which implies that the initialization of the encoding vector in the source separation stage is not trivial. Furthermore, we propose a novel method to initialize the encoding vector for the separation step based on the prior model of the encoding vector. Experimental results showed that the proposed method outperformed the uniform random initialization by 1.09 and 2.21dB in the source-to-distortion ratio, and 0.20 and 0.23 in PESQ scores for supervised and semi-supervised cases, respectively. © 2013 China Institute of Communications.
Author(s)
Byun, JacukShin, Jong Won
Issued Date
2019-09
Type
Article
DOI
10.23919/JCC.2019.09.013
URI
https://scholar.gist.ac.kr/handle/local/12551
Publisher
Editorial Department of China Communications
Citation
China Communications, v.16, no.9, pp.177 - 186
ISSN
1673-5447
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.