Substituting facial movements in singers changes the sounds of musical intervals

Sci Rep. 2021 Nov 17;11(1):22442. doi: 10.1038/s41598-021-01797-z.

Abstract

Cross-modal integration is ubiquitous within perception and, in humans, the McGurk effect demonstrates that seeing a person articulating speech can change what we hear into a new auditory percept. It remains unclear whether cross-modal integration of sight and sound generalizes to other visible vocal articulations like those made by singers. We surmise that perceptual integrative effects should involve music deeply, since there is ample indeterminacy and variability in its auditory signals. We show that switching videos of sung musical intervals changes systematically the estimated distance between two notes of a musical interval so that pairing the video of a smaller sung interval to a relatively larger auditory led to compression effects on rated intervals, whereas the reverse led to a stretching effect. In addition, after seeing a visually switched video of an equally-tempered sung interval and then hearing the same interval played on the piano, the two intervals were judged often different though they differed only in instrument. These findings reveal spontaneous, cross-modal, integration of vocal sounds and clearly indicate that strong integration of sound and sight can occur beyond the articulations of natural speech.

MeSH terms

  • Acoustic Stimulation / methods
  • Adolescent
  • Adult
  • Auditory Perception / physiology*
  • Facial Muscles / physiology*
  • Female
  • Hearing / physiology
  • Humans
  • Male
  • Movement / physiology*
  • Music / psychology*
  • Singing / physiology*
  • Sound*
  • Speech
  • Speech Perception / physiology
  • Students / psychology
  • Voice / physiology*
  • Young Adult