OAK

Visualizing speech styles in captions for deaf and hard-of-hearing viewers

Metadata Downloads
Author(s)
Ahn, SooYeonKim, JooYeongShin, ChoonsungHong, Jin-Hyuk
Type
Article
Citation
International Journal of Human Computer Studies, v.194
Issued Date
2025-02
Abstract
Speech styles such as extension, emphasis, and pause play an important role in capturing the audience's attention and conveying a message accurately. Unfortunately, it is challenging for Deaf and Hard-of-Hearing (DHH) people to enjoy these benefits when watching lectures with common captions. In this paper, we propose a new caption system that automatically analyzes speech styles from audio and visualizes them using visualization elements such as punctuation, paint-on, color, and boldness. We conducted a comparative study with 26 DHH viewers and found that the proposed caption system enabled them to recognize the speaker's speech style in lectures. As a result, the DHH viewers were able to watch lecture videos more vividly and were more engaged with the lectures. In particular, punctuation can be a practical solution to visualize speech styles and ensure legibility. Participants expressed a desire to use our caption system in their daily lives, providing valuable insights for future sound-visualized caption research. © 2024 Elsevier Ltd
Publisher
Academic Press
ISSN
1071-5819
DOI
10.1016/j.ijhcs.2024.103386
URI
https://scholar.gist.ac.kr/handle/local/9075
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.