Time-resolved discrimination of audio-visual emotion expressions

Cortex. 2019 Oct:119:184-194. doi: 10.1016/j.cortex.2019.04.017. Epub 2019 May 6.

Abstract

Humans seamlessly extract and integrate the emotional content delivered by the face and the voice of others. It is however poorly understood how perceptual decisions unfold in time when people discriminate the expression of emotions transmitted using dynamic facial and vocal signals, as in natural social context. In this study, we relied on a gating paradigm to track how the recognition of emotion expressions across the senses unfold over exposure time. We first demonstrate that across all emotions tested, a discriminatory decision is reached earlier with faces than with voices. Importantly, multisensory stimulation consistently reduced the required accumulation of perceptual evidences needed to reach correct discrimination (Isolation Point). We also observed that expressions with different emotional content provide cumulative evidence at different speeds, with "fear" being the expression with the fastest isolation point across the senses. Finally, the lack of correlation between the confusion patterns in response to facial and vocal signals across time suggest distinct relations between the discriminative features extracted from the two signals. Altogether, these results provide a comprehensive view on how auditory, visual and audiovisual information related to different emotion expressions accumulate in time, highlighting how multisensory context can fasten the discrimination process when minimal information is available.

Keywords: Emotions; Face; Gating; Multisensory; Voice.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Emotions / physiology*
  • Expressed Emotion / physiology
  • Facial Expression
  • Female
  • Humans
  • Male
  • Photic Stimulation / methods
  • Recognition, Psychology / physiology*
  • Time Factors*
  • Voice / physiology*
  • Young Adult