Formant frequency estimation of high-pitched vowels using weighted linear prediction

J Acoust Soc Am. 2013 Aug;134(2):1295-313. doi: 10.1121/1.4812756.

Abstract

All-pole modeling is a widely used formant estimation method, but its performance is known to deteriorate for high-pitched voices. In order to address this problem, several all-pole modeling methods robust to fundamental frequency have been proposed. This study compares five such previously known methods and introduces a technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes temporally weighted linear prediction (LP) in which the square of the prediction error is multiplied by a given parametric weighting function. The weighting downgrades the contribution of the main excitation of the vocal tract in optimizing the filter coefficients. Consequently, the resulting all-pole model is affected more by the characteristics of the vocal tract leading to less biased formant estimates. By using synthetic vowels created with a physical modeling approach, the results showed that WLP-AME yields improved formant frequencies for high-pitched sounds in comparison to the previously known methods (e.g., relative error in the first formant of the vowel [a] decreased from 11% to 3% when conventional LP was replaced with WLP-AME). Experiments conducted on natural vowels indicate that the formants detected by WLP-AME changed in a more regular manner between repetitions of different pitch than those computed by conventional LP.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Algorithms
  • Biomechanical Phenomena
  • Child, Preschool
  • Computer Simulation
  • Female
  • Glottis / anatomy & histology
  • Glottis / physiology*
  • Humans
  • Linear Models*
  • Male
  • Numerical Analysis, Computer-Assisted
  • Pattern Recognition, Automated
  • Phonation*
  • Phonetics*
  • Pitch Perception*
  • Pressure
  • Signal Processing, Computer-Assisted
  • Sound Spectrography
  • Speech Acoustics*
  • Speech Production Measurement
  • Time Factors
  • Vocal Cords / physiology
  • Voice Quality*