Voice data mining for laryngeal pathology assessment

Daria Hemmerling; Andrzej Skalski; Janusz Gajda

doi:10.1016/j.compbiomed.2015.07.026

Voice data mining for laryngeal pathology assessment

Comput Biol Med. 2016 Feb 1:69:270-6. doi: 10.1016/j.compbiomed.2015.07.026. Epub 2015 Aug 10.

Authors

Daria Hemmerling¹, Andrzej Skalski², Janusz Gajda²

Affiliations

¹ AGH University of Science and Technology, Department of Measurement and Electronics, Al. Mickiewicza 30, 30-059 Krakow, Poland. Electronic address: dpanek@agh.edu.pl.
² AGH University of Science and Technology, Department of Measurement and Electronics, Al. Mickiewicza 30, 30-059 Krakow, Poland.

PMID: 26471193
DOI: 10.1016/j.compbiomed.2015.07.026

Abstract

The aim of this study was to evaluate the usefulness of different methods of speech signal analysis in the detection of voice pathologies. Firstly, an initial vector was created consisting of 28 parameters extracted from time, frequency and cepstral domain describing the human voice signal based on the analysis of sustained vowels /a/, /i/ and /u/ all at high, low and normal pitch. Afterwards we used a linear feature extraction technique (principal component analysis), which enabled a reduction in the number of parameters and choose the most effective acoustic features describing the speech signal. We have also performed non-linear data transformation which was calculated using kernel principal components. The results of the presented methods for normal and pathological cases will be revealed and discussed in this paper. The initial and extracted feature vectors were classified using the k-means clustering and the random forest classifier. We found that reasonably good classification accuracies could be achieved by selecting appropriate features. We obtained accuracies of up to 100% for classification of healthy versus pathology voice using random forest classification for female and male recordings. These results may assist in the feature development of automated detection systems for diagnosis of patients with symptoms of pathological voice.

Keywords: Acoustic analysis; Feature selection; PCA; Random forest; Voice pathology detection; kPCA.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Data Mining / methods*
Female
Humans
Laryngeal Diseases / diagnosis*
Laryngeal Diseases / physiopathology*
Male
Signal Processing, Computer-Assisted*
Voice*