Hybrid Machine Learning Framework for Multistage Parkinson's Disease Classification Using Acoustic Features of Sustained Korean Vowels

S I M M Raton Mondol; Ryul Kim; Sangmin Lee

doi:10.3390/bioengineering10080984

Hybrid Machine Learning Framework for Multistage Parkinson's Disease Classification Using Acoustic Features of Sustained Korean Vowels

Bioengineering (Basel). 2023 Aug 20;10(8):984. doi: 10.3390/bioengineering10080984.

Authors

S I M M Raton Mondol¹, Ryul Kim², Sangmin Lee¹

Affiliations

¹ Department of Electrical and Computer Engineering, Inha University, Incheon 22212, Republic of Korea.
² Department of Neurology, Inha University Hospital, Inha University College of Medicine, Incheon 22212, Republic of Korea.

Abstract

Recent research has achieved a great classification rate for separating healthy people from those with Parkinson's disease (PD) using speech and the voice. However, these studies have primarily treated early and advanced stages of PD as equal entities, neglecting the distinctive speech impairments and other symptoms that vary across the different stages of the disease. To address this limitation, and improve diagnostic precision, this study assesses the selected acoustic features of dysphonia, as they relate to PD and the Hoehn and Yahr stages, by combining various preprocessing techniques and multiple classification algorithms, to create a comprehensive and robust solution for classification tasks. The dysphonia features extracted from the three sustained Korean vowels /아/(a), /이/(i), and /우/(u) exhibit diversity and strong correlations. To address this issue, the analysis of variance F-Value feature selection classifier from scikit-learn was employed, to identify the topmost relevant features. Additionally, to overcome the class imbalance problem, the synthetic minority over-sampling technique was utilized. To ensure fair comparisons, and mitigate the influence of individual classifiers, four commonly used machine learning classifiers, namely random forest (RF), support vector machine (SVM), k-nearest neighbor (kNN), and multi-layer perceptron (MLP), were employed. This approach enables a comprehensive evaluation of the feature extraction methods, and minimizes the variance in the final classification models. The proposed hybrid machine learning pipeline using the acoustic features of sustained vowels efficiently detects the early and mid-advanced stages of PD with a detection accuracy of 95.48%, and with a detection accuracy of 86.62% for the 4-stage, and a detection accuracy of 89.48% for the 3-stage classification of PD. This study successfully demonstrates the significance of utilizing the diverse acoustic features of dysphonia in the classification of PD and its stages.

Keywords: dysphonia features; machine learning classifiers; multistage Parkinson’s disease; voice biomarkers.

Abstract

Grants and funding