Word-timestamped transcripts of two spoken narrative recall functional neuroimaging datasets

Savannah J Born; Kathy Shi; Haemy Lee Masson; Hongmi Lee; Yoonjung Lee; Janice Chen

doi:10.1016/j.dib.2023.109490

Word-timestamped transcripts of two spoken narrative recall functional neuroimaging datasets

Data Brief. 2023 Aug 9:50:109490. doi: 10.1016/j.dib.2023.109490. eCollection 2023 Oct.

Authors

Savannah J Born^{1

2}, Kathy Shi¹, Haemy Lee Masson³, Hongmi Lee^{1

4}, Yoonjung Lee¹, Janice Chen¹

Affiliations

¹ Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, Maryland, United States.
² Department of Psychological & Brain Sciences, Washington University in St. Louis, St. Louis, MO, United States.
³ Department of Psychology, Durham University, Durham, United Kingdom.
⁴ Department of Psychological Sciences, Purdue University, West Lafayette, IN, United States.

Abstract

After watching audiovisual movies, human participants produced spoken narrative recollections during functional magnetic resonance imaging (fMRI); presented here are word-level timestamps of their speech, temporally aligned to the publicly shared fMRI data. For the "FilmFestival" dataset, twenty participants watched ten short audiovisual movies, approximately 2-8 minutes each. For the "Sherlock" dataset, seventeen participants watched the first half of the first episode of BBC's Sherlock (48 minutes). After viewing, participants then verbally described what they remembered about the movies in their own words. Participants' speech was recorded using an MR-compatible microphone. The audio recordings were transcribed, then timestamped by a forced aligner; missing timestamps were filled in manually by human transcriptionists referencing the audio recording. Each file contains the participant's recall word by word, onset of each word in seconds with 1/10^th-second precision, and the corresponding fMRI volume number (TR). This dataset can be used to investigate topics such as naturalistic memory and language production.

Keywords: Language; Memory; Naturalistic; Recollection; Speech production; fMRI.

Grants and funding

R01 MH133732/MH/NIMH NIH HHS/United States