OAK

µCap: Instrumental Music Captions for Deaf and Hard-of-Hearing Individuals

Metadata Downloads
Author(s)
Ahn, SooYeonBaek, In-ChangKim, KyungJoongN. Truong, KhaiHong, Jin-Hyuk
Type
Conference Paper
Citation
CHI 2026: CHI Conference on Human Factors in Computing Systems, pp.1 - 26
Issued Date
2026-04-13
Abstract
Instrumental music conveys rich affective experiences through acoustic cues, yet instrumental passages often remain inaccessible to Deaf and Hard-of-Hearing (DHH) audiences. Although captioning practices for vocal songs have expanded, instrumental music remains largely uncaptioned, with no established criteria for representing musical content in text. We propose μCap (Music Captions), an automatic instrumental music captioning system that transforms instrumental audio into time-aligned, non-lexical textual renderings enhanced with simple visuals. Drawing on Preliminary surveys with DHH individuals and expert group discussions, we developed a phonetic-like captioning schema grounded in music sound analysis and linguistics. We then implemented μCap using audio feature extraction and a retrieval-augmented generation pipeline to produce expressive, sound-mimetic captions. Two user evaluations with DHH participants (n=20 and n=15) showed that μCap enhanced music appreciation, immersion, and perceived presence of acoustic detail. This work contributes empirical evidence and insights for designing caption-based visual representations that make instrumental music more accessible. © 2026 Copyright held by the owner/author(s).
Publisher
ACM
Conference Place
SP
Barcelona Spain
URI
https://scholar.gist.ac.kr/handle/local/34174
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.