Voice modulatory cues to structure across languages and species

Philos Trans R Soc Lond B Biol Sci. 2021 Dec 20;376(1840):20200393. doi: 10.1098/rstb.2020.0393. Epub 2021 Nov 1.

Abstract

Voice modulatory cues such as variations in fundamental frequency, duration and pauses are key factors for structuring vocal signals in human speech and vocal communication in other tetrapods. Voice modulation physiology is highly similar in humans and other tetrapods due to shared ancestry and shared functional pressures for efficient communication. This has led to similarly structured vocalizations across humans and other tetrapods. Nonetheless, in their details, structural characteristics may vary across species and languages. Because data concerning voice modulation in non-human tetrapod vocal production and especially perception are relatively scarce compared to human vocal production and perception, this review focuses on voice modulatory cues used for speech segmentation across human languages, highlighting comparative data where available. Cues that are used similarly across many languages may help indicate which cues may result from physiological or basic cognitive constraints, and which cues may be employed more flexibly and are shaped by cultural evolution. This suggests promising candidates for future investigation of cues to structure in non-human tetrapod vocalizations. This article is part of the theme issue 'Voice modulation: from origin and mechanism to social impact (Part I)'.

Keywords: cross-linguistic comparisons; cross-species comparisons; linguistic structure; prosody; speech segmentation; voice modulation.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Cues
  • Language
  • Speech
  • Speech Perception*
  • Voice*