Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band

J Acoust Soc Am. 2007 Sep;122(3):EL88. doi: 10.1121/1.2761927.

Abstract

The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.

Publication types

  • Letter

MeSH terms

  • Auditory Perception / physiology*
  • Hearing / physiology*
  • Humans
  • Models, Biological
  • Perceptual Masking
  • Psychoacoustics
  • Sound Spectrography
  • Speech / physiology*
  • Speech Intelligibility
  • Speech Perception / physiology*