Hierarchical Classification and System Combination for Automatically Identifying Physiological and Neuromuscular Laryngeal Pathologies

J Voice. 2017 May;31(3):384.e9-384.e14. doi: 10.1016/j.jvoice.2016.09.003. Epub 2016 Oct 12.

Abstract

Objectives: Speech signal processing techniques have provided several contributions to pathologic voice identification, in which healthy and unhealthy voice samples are evaluated. A less common approach is to identify laryngeal pathologies, for which the use of a noninvasive method for pathologic voice identification is an important step forward for preliminary diagnosis. In this study, a hierarchical classifier and a combination of systems are used to improve the accuracy of a three-class identification system (healthy, physiological larynx pathologies, and neuromuscular larynx pathologies).

Method: Three main subject classes were considered: subjects with physiological larynx pathologies (vocal fold nodules and edemas: 59 samples), subjects with neuromuscular larynx pathologies (unilateral vocal fold paralysis: 59 samples), and healthy subjects (36 samples). The variables used in this study were a speech task (sustained vowel /a/ or continuous reading speech), features with or without perceptual information, and features with or without direct information about formants evaluated using single classifiers. A hierarchical classification system was designed based on this information.

Results: The resulting system combines an analysis of continuous speech by way of the commonly used sustained vowel /a/ to obtain spectral and perceptual speech features. It achieved an accuracy of 84.4%, which represents an improvement of approximately 9% compared with the stand-alone approach. For pathologic voice identification, the accuracy obtained was 98.7%, and the identification accuracy for the two pathology classes was 81.3%.

Conclusions: Hierarchical classification and system combination create significant benefits and introduce a modular approach to the classification of larynx pathologies.

Keywords: Continuous speech; Hierarchical classification; Larynx pathology identification; Pathologic voice identification; Sustained vowel.

MeSH terms

  • Case-Control Studies
  • Databases, Factual
  • Diagnosis, Computer-Assisted / methods*
  • Edema / classification
  • Edema / diagnosis*
  • Edema / pathology
  • Edema / physiopathology
  • Female
  • Humans
  • Male
  • Pattern Recognition, Automated
  • Predictive Value of Tests
  • Signal Processing, Computer-Assisted*
  • Speech Acoustics
  • Speech Production Measurement / methods*
  • Support Vector Machine*
  • Vocal Cord Paralysis / classification
  • Vocal Cord Paralysis / diagnosis*
  • Vocal Cord Paralysis / pathology
  • Vocal Cord Paralysis / physiopathology
  • Vocal Cords* / pathology
  • Vocal Cords* / physiopathology
  • Voice Disorders / classification
  • Voice Disorders / diagnosis*
  • Voice Disorders / pathology
  • Voice Disorders / physiopathology
  • Voice Quality