OAK

Cross-Corpus Speech Emotion Recognition Based on Few-Shot Learning and Domain Adaptation

Metadata Downloads
Abstract
Within a single speech emotion corpus, deep neural networks have shown decent performance in speech emotion recognition. However, the performance of the emotion recognition based on data-driven learning methods degrades significantly for the cross-corpus scenario. To relieve this issue without any labeled samples from the target domain, we propose a cross-corpus speech emotion recognition based on few-shot learning and unsupervised domain adaptation, which is trained to learn the class (emotion) similarity from the source domain samples adapted to the target domain. In addition, we utilize multiple corpora in training to enhance the robustness of the emotion recognition to the unseen samples. Experiments on emotional speech corpora with three different languages showed that the proposed method outperformed other approaches.
Author(s)
Ahn, YoungdoLee, Sung JooShin, Jong Won
Issued Date
2021-06
Type
Article
DOI
10.1109/LSP.2021.3086395
URI
https://scholar.gist.ac.kr/handle/local/11460
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Citation
IEEE SIGNAL PROCESSING LETTERS, v.28, pp.1190 - 1194
ISSN
1070-9908
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.