Robustness of auditory Teager Energy Cepstrum Coefficients for classification of pathological and normal voices in noisy environments

Lotfi Salhi; Adnane Cherif

doi:10.1155/2013/435729

Robustness of auditory Teager Energy Cepstrum Coefficients for classification of pathological and normal voices in noisy environments

ScientificWorldJournal. 2013 May 28:2013:435729. doi: 10.1155/2013/435729. Print 2013.

Authors

Lotfi Salhi¹, Adnane Cherif

Affiliation

¹ Signal Processing Laboratory, Physics Department, Sciences Faculty of Tunis, University of Tunis ElManar, 1060 Tunis, Tunisia. lotfi.salhi@laposte.net

Abstract

This paper focuses on a robust feature extraction algorithm for automatic classification of pathological and normal voices in noisy environments. The proposed algorithm is based on human auditory processing and the nonlinear Teager-Kaiser energy operator. The robust features which labeled Teager Energy Cepstrum Coefficients (TECCs) are computed in three steps. Firstly, each speech signal frame is passed through a Gammatone or Mel scale triangular filter bank. Then, the absolute value of the Teager energy operator of the short-time spectrum is calculated. Finally, the discrete cosine transform of the log-filtered Teager Energy spectrum is applied. This feature is proposed to identify the pathological voices using a developed neural system of multilayer perceptron (MLP). We evaluate the developed method using mixed voice database composed of recorded voice samples from normophonic or dysphonic speakers. In order to show the robustness of the proposed feature in detection of pathological voices at different White Gaussian noise levels, we compare its performance with results for clean environments. The experimental results show that TECCs computed from Gammatone filter bank are more robust in noisy environments than other extracted features, while their performance is practically similar to clean environments.

MeSH terms

Adult
Aged
Aged, 80 and over
Algorithms
Diagnosis, Computer-Assisted / methods*
Female
Humans
Male
Middle Aged
Noise*
Pattern Recognition, Automated / methods*
Reproducibility of Results
Sensitivity and Specificity
Sound Spectrography / methods*
Speech Recognition Software*
Voice Disorders / diagnosis*
Voice Disorders / physiopathology*
Voice Quality