Speech Computations of the Human Superior Temporal Gyrus

Annu Rev Psychol. 2022 Jan 4:73:79-102. doi: 10.1146/annurev-psych-022321-035256. Epub 2021 Oct 21.

Abstract

Human speech perception results from neural computations that transform external acoustic speech signals into internal representations of words. The superior temporal gyrus (STG) contains the nonprimary auditory cortex and is a critical locus for phonological processing. Here, we describe how speech sound representation in the STG relies on fundamentally nonlinear and dynamical processes, such as categorization, normalization, contextual restoration, and the extraction of temporal structure. A spatial mosaic of local cortical sites on the STG exhibits complex auditory encoding for distinct acoustic-phonetic and prosodic features. We propose that as a population ensemble, these distributed patterns of neural activity give rise to abstract, higher-order phonemic and syllabic representations that support speech perception. This review presents a multi-scale, recurrent model of phonological processing in the STG, highlighting the critical interface between auditory and language systems.

Keywords: categorization; contextual restoration; phonological processing; superior temporal gyrus; temporal landmarks.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Auditory Cortex*
  • Brain Mapping / methods
  • Humans
  • Magnetic Resonance Imaging / methods
  • Phonetics
  • Speech
  • Speech Perception*
  • Temporal Lobe