Effective dysphonia detection using feature dimension reduction and kernel density estimation for patients with Parkinson's disease

PLoS One. 2014 Feb 20;9(2):e88825. doi: 10.1371/journal.pone.0088825. eCollection 2014.

Abstract

Detection of dysphonia is useful for monitoring the progression of phonatory impairment for patients with Parkinson's disease (PD), and also helps assess the disease severity. This paper describes the statistical pattern analysis methods to study different vocal measurements of sustained phonations. The feature dimension reduction procedure was implemented by using the sequential forward selection (SFS) and kernel principal component analysis (KPCA) methods. Four selected vocal measures were projected by the KPCA onto the bivariate feature space, in which the class-conditional feature densities can be approximated with the nonparametric kernel density estimation technique. In the vocal pattern classification experiments, Fisher's linear discriminant analysis (FLDA) was applied to perform the linear classification of voice records for healthy control subjects and PD patients, and the maximum a posteriori (MAP) decision rule and support vector machine (SVM) with radial basis function kernels were employed for the nonlinear classification tasks. Based on the KPCA-mapped feature densities, the MAP classifier successfully distinguished 91.8% voice records, with a sensitivity rate of 0.986, a specificity rate of 0.708, and an area value of 0.94 under the receiver operating characteristic (ROC) curve. The diagnostic performance provided by the MAP classifier was superior to those of the FLDA and SVM classifiers. In addition, the classification results indicated that gender is insensitive to dysphonia detection, and the sustained phonations of PD patients with minimal functional disability are more difficult to be correctly identified.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Area Under Curve
  • Discriminant Analysis
  • Dysphonia / diagnosis*
  • Dysphonia / etiology
  • Female
  • Humans
  • Male
  • Middle Aged
  • Parkinson Disease / complications
  • Parkinson Disease / pathology*
  • Phonation / physiology*
  • Principal Component Analysis
  • ROC Curve
  • Sensitivity and Specificity
  • Support Vector Machine

Grants and funding

This work was supported by the National Natural Science Foundation of China (grant no. 81101115, 31200769, 81272168), the Natural Science Foundation of Fujian (grant no. 2011J01371), and the Fundamental Research Funds for the Central Universities of China (grant no. 2010121061). Yunfeng Wu was also supported by the 2013 Program for New Century Excellent Talents in Fujian Province University. Sridhar Krishnan was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canada Research Chairs Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.