Word-timestamped transcripts of two spoken narrative recall functional neuroimaging datasets

Data Brief. 2023 Aug 9:50:109490. doi: 10.1016/j.dib.2023.109490. eCollection 2023 Oct.

Abstract

After watching audiovisual movies, human participants produced spoken narrative recollections during functional magnetic resonance imaging (fMRI); presented here are word-level timestamps of their speech, temporally aligned to the publicly shared fMRI data. For the "FilmFestival" dataset, twenty participants watched ten short audiovisual movies, approximately 2-8 minutes each. For the "Sherlock" dataset, seventeen participants watched the first half of the first episode of BBC's Sherlock (48 minutes). After viewing, participants then verbally described what they remembered about the movies in their own words. Participants' speech was recorded using an MR-compatible microphone. The audio recordings were transcribed, then timestamped by a forced aligner; missing timestamps were filled in manually by human transcriptionists referencing the audio recording. Each file contains the participant's recall word by word, onset of each word in seconds with 1/10th-second precision, and the corresponding fMRI volume number (TR). This dataset can be used to investigate topics such as naturalistic memory and language production.

Keywords: Language; Memory; Naturalistic; Recollection; Speech production; fMRI.