A mobile-assisted voice condition analysis system for Parkinson's disease: assessment of usability conditions

Javier Carrón; Yolanda Campos-Roca; Mario Madruga; Carlos J Pérez

doi:10.1186/s12938-021-00951-y

A mobile-assisted voice condition analysis system for Parkinson's disease: assessment of usability conditions

Biomed Eng Online. 2021 Nov 21;20(1):114. doi: 10.1186/s12938-021-00951-y.

Authors

Javier Carrón¹, Yolanda Campos-Roca², Mario Madruga¹, Carlos J Pérez³

Affiliations

¹ Departamento de Matemáticas, Universidad de Extremadura, Cáceres, Spain.
² Departamento de Tecnología de los Computadores y las Comunicaciones, Universidad de Extremadura, Cáceres, Spain.
³ Departamento de Matemáticas, Universidad de Extremadura, Cáceres, Spain. carper@unex.es.

Abstract

Background and objective: Automatic voice condition analysis systems to detect Parkinson's disease (PD) are generally based on speech data recorded under acoustically controlled conditions and professional supervision. The performance of these approaches in a free-living scenario is unknown. The aim of this research is to investigate the impact of uncontrolled conditions (realistic acoustic environment and lack of supervision) on the performance of automatic PD detection systems based on speech.

Methods: A mobile-assisted voice condition analysis system is proposed to aid in the detection of PD using speech. The system is based on a server-client architecture. In the server, feature extraction and machine learning algorithms are designed and implemented to discriminate subjects with PD from healthy ones. The Android app allows patients to submit phonations and physicians to check the complete record of every patient. Six different machine learning classifiers are applied to compare their performance on two different speech databases. One of them is an in-house database (UEX database), collected under professional supervision by using the same Android-based smartphone in the same room, whereas the other one is an age, sex and health-status balanced subset of mPower study for PD, which provides real-world data. By applying identical methodology, single-database experiments have been performed on each database, and also cross-database tests. Cross-validation has been applied to assess generalization performance and hypothesis tests have been used to report statistically significant differences.

Results: In the single-database experiments, a best accuracy rate of 0.92 (AUC = 0.98) has been obtained on UEX database, while a considerably lower best accuracy rate of 0.71 (AUC = 0.76) has been achieved using the mPower-based database. The cross-database tests provided very degraded accuracy metrics.

Conclusion: The results clearly show the potential of the proposed system as an aid for general practitioners to conduct triage or an additional tool for neurologists to perform diagnosis. However, due to the performance degradation observed using data from mPower study, semi-controlled conditions are encouraged, i.e., voices recorded at home by the patients themselves following a strict recording protocol and control of the information about patients by the medical doctor at charge.

Keywords: Acoustic features; Machine learning; Parkinson’s disease; Speech processing; Voice condition analysis system; mPower database.

MeSH terms

Algorithms
Humans
Machine Learning
Parkinson Disease* / diagnosis
Smartphone
Speech

Abstract

MeSH terms

Grants and funding