Deep learning reveals what vocal bursts express in different cultures

Nat Hum Behav. 2023 Feb;7(2):240-250. doi: 10.1038/s41562-022-01489-2. Epub 2022 Dec 28.

Abstract

Human social life is rich with sighs, chuckles, shrieks and other emotional vocalizations, called 'vocal bursts'. Nevertheless, the meaning of vocal bursts across cultures is only beginning to be understood. Here, we combined large-scale experimental data collection with deep learning to reveal the shared and culture-specific meanings of vocal bursts. A total of n = 4,031 participants in China, India, South Africa, the USA and Venezuela mimicked vocal bursts drawn from 2,756 seed recordings. Participants also judged the emotional meaning of each vocal burst. A deep neural network tasked with predicting the culture-specific meanings people attributed to vocal bursts while disregarding context and speaker identity discovered 24 acoustic dimensions, or kinds, of vocal expression with distinct emotion-related meanings. The meanings attributed to these complex vocal modulations were 79% preserved across the five countries and three languages. These results reveal the underlying dimensions of human emotional vocalization in remarkable detail.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustics
  • Deep Learning*
  • Emotions
  • Humans
  • Language
  • Voice*