Pathological speech signal analysis and classification using empirical mode decomposition

Med Biol Eng Comput. 2013 Jul;51(7):811-21. doi: 10.1007/s11517-013-1051-8. Epub 2013 Mar 5.

Abstract

Automated classification of normal and pathological speech signals can provide an objective and accurate mechanism for pathological speech diagnosis, and is an active area of research. A large part of this research is based on analysis of acoustic measures extracted from sustained vowels. However, sustained vowels do not reflect real-world attributes of voice as effectively as continuous speech, which can take into account important attributes of speech such as rapid voice onset and termination, changes in voice frequency and amplitude, and sudden discontinuities in speech. This paper presents a methodology based on empirical mode decomposition (EMD) for classification of continuous normal and pathological speech signals obtained from a well-known database. EMD is used to decompose randomly chosen portions of speech signals into intrinsic mode functions, which are then analyzed to extract meaningful temporal and spectral features, including true instantaneous features which can capture discriminative information in signals hidden at local time-scales. A total of six features are extracted, and a linear classifier is used with the feature vector to classify continuous speech portions obtained from a database consisting of 51 normal and 161 pathological speakers. A classification accuracy of 95.7 % is obtained, thus demonstrating the effectiveness of the methodology.

MeSH terms

  • Algorithms*
  • Humans
  • Signal Processing, Computer-Assisted*
  • Speech
  • Speech Acoustics
  • Speech Disorders / diagnosis*
  • Time Factors