You Described, We Archived: A Rich Audio Description Dataset

Charity Pitcher-Cooper; Manali Seth; Benjamin Kao; James M Coughlan; Ilmi Yoon

You Described, We Archived: A Rich Audio Description Dataset

J Technol Pers Disabil. 2023 May:11:192-208. Epub 2024 Jan 19.

Authors

Charity Pitcher-Cooper¹, Manali Seth², Benjamin Kao², James M Coughlan¹, Ilmi Yoon²

Affiliations

¹ Smith-Kettlewell Eye Research Institute.
² Computer Science, San Francisco State University.

PMID: 38516032
PMCID: PMC10956524

Abstract

The You Described, We Archived dataset (YuWA) is a collaboration between San Francisco State University and The Smith-Kettlewell Eye Research Institute. It includes audio description (AD) data collected worldwide 2013-2022 through YouDescribe, an accessibility tool for adding audio descriptions to YouTube videos. YouDescribe, a web-based audio description tool along with an iOS viewing app, has a community of 12,000+ average annual visitors, with approximately 3,000 volunteer describers, and has created over 5,500 audio described YouTube videos. Blind and visually impaired (BVI) viewers request videos, which then are saved to a wish list and volunteer audio describers select a video, write a script, record audio clips, and edit clip placement to create an audio description. The AD tracks are stored separately, posted for public view at https://youdescribe.org/ and played together with the YouTube video. The YuWA audio description data paired with the describer and viewer metadata, and collection timeline has a large number of research applications including artificial intelligence, machine learning, sociolinguistics, audio description, video understanding, video retrieval and video-language grounding tasks.

Keywords: Artificial Intelligence; Audio Description; Blind and Low Vision; Machine Learning; Sociolinguistics; Video Accessibility.

Abstract

Grants and funding