Multiband analysis and synthesis of spectro-temporal modulations of Fourier spectrogram

J Acoust Soc Am. 2011 May;129(5):EL190-6. doi: 10.1121/1.3565471.

Abstract

The two-dimensional spectro-temporal modulation filtering concept of the auditory model [T. Chi, P. Ru, and S. A. Shamma, J. Acoust. Soc. Am. 118(2), 887-906 (2005)] is implemented on the Fourier spectrogram. The Fourier magnitude spectrogram is analyzed in terms of its joint spectro-temporal modulations, which embed the temporal dynamics and spectral structures. Instead of iterative projection methods, the overlap-and-add method is adopted to invert modified Fourier spectrograms back to sounds. The proposed framework not only provides a similar spectro-temporal analytical process for sounds as the auditory model but also produces synthesized sounds with better quality in a timely manner, which makes proposed framework feasible to human speech recognition (HSR) applications as well.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Auditory Cortex / physiology
  • Communication Aids for Disabled
  • Female
  • Fourier Analysis*
  • Humans
  • Male
  • Models, Neurological
  • Phonetics
  • Sensory Receptor Cells / physiology
  • Sound Spectrography / methods*
  • Sound Spectrography / statistics & numerical data
  • Speech Intelligibility
  • Speech Recognition Software*