Fluidity in the perception of auditory speech: Cross-modal recalibration of voice gender and vowel identity by a talking face

Merel A Burgering; Thijs van Laarhoven; Martijn Baart; Jean Vroomen

doi:10.1177/1747021819900884

Fluidity in the perception of auditory speech: Cross-modal recalibration of voice gender and vowel identity by a talking face

Q J Exp Psychol (Hove). 2020 Jun;73(6):957-967. doi: 10.1177/1747021819900884. Epub 2020 Jan 30.

Authors

Merel A Burgering¹, Thijs van Laarhoven¹, Martijn Baart^{1

2}, Jean Vroomen¹

Affiliations

¹ Department of Cognitive Neuropsychology, Tilburg University, Tilburg, The Netherlands.
² BCBL-Basque Center on Cognition, Brain and Language, Donostia-San Sebastián, Spain.

PMID: 31931664
DOI: 10.1177/1747021819900884

Abstract

Humans quickly adapt to variations in the speech signal. Adaptation may surface as recalibration, a learning effect driven by error-minimisation between a visual face and an ambiguous auditory speech signal, or as selective adaptation, a contrastive aftereffect driven by the acoustic clarity of the sound. Here, we examined whether these aftereffects occur for vowel identity and voice gender. Participants were exposed to male, female, or androgynous tokens of speakers pronouncing /e/, /ø/, (embedded in words with a consonant-vowel-consonant structure), or an ambiguous vowel halfway between /e/ and /ø/ dubbed onto the video of a male or female speaker pronouncing /e/ or /ø/. For both voice gender and vowel identity, we found assimilative aftereffects after exposure to auditory ambiguous adapter sounds, and contrastive aftereffects after exposure to auditory clear adapter sounds. This demonstrates that similar principles for adaptation in these dimensions are at play.

Keywords: Audiovisual integration; gender; recalibration; selective adaptation; vowel.

MeSH terms

Adult
Facial Recognition / physiology*
Female
Humans
Male
Sex Factors
Social Perception*
Speech Perception / physiology*
Young Adult