Demographic and Symptomatic Features of Voice Disorders and Their Potential Application in Classification Using Machine Learning Algorithms

Folia Phoniatr Logop. 2018;70(3-4):174-182. doi: 10.1159/000492327. Epub 2018 Sep 5.

Abstract

Background: Studies have used questionnaires of dysphonic symptoms to screen voice disorders. This study investigated whether the differential presentation of demographic and symptomatic features can be applied to computerized classification.

Methods: We recruited 100 patients with glottic neoplasm, 508 with phonotraumatic lesions, and 153 with unilateral vocal palsy. Statistical analyses revealed significantly different distributions of demographic and symptomatic variables. Machine learning algorithms, including decision tree, linear discriminant analysis, K-nearest neighbors, support vector machine, and artificial neural network, were applied to classify voice disorders.

Results: The results showed that demographic features were more effective for detecting neoplastic and phonotraumatic lesions, whereas symptoms were useful for detecting vocal palsy. When combining demographic and symptomatic variables, the artificial neural network achieved the highest accuracy of 83 ± 1.58%, whereas the accuracy achieved by other algorithms ranged from 74 to 82.6%. Decision tree analyses revealed that sex, age, smoking status, sudden onset of dysphonia, and 10-item voice handicap index scores were significant characteristics for classification.

Conclusion: This study demonstrated a significant difference in demographic and symptomatic features between glottic neoplasm, phonotraumatic lesions, and vocal palsy. These features may facilitate automatic classification of voice disorders through machine learning algorithms.

Keywords: Cyst; Larynx; Neoplasm; Nodules; Polyp; Vocal palsy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Age Factors
  • Aged
  • Alcohol Drinking / epidemiology
  • Algorithms
  • Demography
  • Female
  • Glottis / injuries
  • Glottis / physiopathology
  • Humans
  • Laryngeal Neoplasms / complications
  • Laryngeal Neoplasms / diagnosis
  • Laryngeal Neoplasms / physiopathology
  • Male
  • Middle Aged
  • Neural Networks, Computer*
  • Retrospective Studies
  • Severity of Illness Index
  • Sex Factors
  • Smoking / epidemiology
  • Supervised Machine Learning*
  • Symptom Assessment
  • Vocal Cord Paralysis / complications
  • Vocal Cord Paralysis / diagnosis
  • Vocal Cord Paralysis / physiopathology
  • Voice Disorders / classification*
  • Voice Disorders / epidemiology
  • Voice Quality
  • Wounds and Injuries / diagnosis