Data-Driven Based Approach to Aid Parkinson's Disease Diagnosis

Sensors (Basel). 2019 Jan 10;19(2):242. doi: 10.3390/s19020242.

Abstract

This article presents a machine learning methodology for diagnosing Parkinson's disease (PD) based on the use of vertical Ground Reaction Forces (vGRFs) data collected from the gait cycle. A classification engine assigns subjects to healthy or Parkinsonian classes. The diagnosis process involves four steps: data pre-processing, feature extraction and selection, data classification and performance evaluation. The selected features are used as inputs of each classifier. Feature selection is achieved through a wrapper approach established using the random forest algorithm. The proposed methodology uses both supervised classification methods including K-nearest neighbour (K-NN), decision tree (DT), random forest (RF), Naïve Bayes (NB), support vector machine (SVM) and unsupervised classification methods such as K-means and the Gaussian mixture model (GMM). To evaluate the effectiveness of the proposed methodology, an online dataset collected within three different studies is used. This data set includes vGRF measurements collected from eight force sensors placed under each foot of the subjects. Ninety-three patients suffering from Parkinson's disease and 72 healthy subjects participated in the experiments. The obtained performances are compared with respect to various metrics including accuracy, precision, recall and F-measure. The classification performance evaluation is performed using the leave-one-out cross validation. The results demonstrate the ability of the proposed methodology to accurately differentiate between PD subjects and healthy subjects. For the purpose of validation, the proposed methodology is also evaluated with an additional dataset including subjects with neurodegenerative diseases (Amyotrophic Lateral Sclerosis (ALS) and Huntington's disease (HD)). The obtained results show the effectiveness of the proposed methodology to discriminate PD subjects from subjects with other neurodegenerative diseases with a relatively high accuracy.

Keywords: Parkinson diseases; classification; features selection method; gait cycle; vertical ground reaction forces (vGRFs); wearable sensors.

MeSH terms

  • Algorithms
  • Amyotrophic Lateral Sclerosis / diagnosis
  • Amyotrophic Lateral Sclerosis / physiopathology
  • Bayes Theorem
  • Diagnosis, Differential
  • Female
  • Gait / physiology*
  • Humans
  • Huntington Disease / diagnosis
  • Huntington Disease / physiopathology
  • Male
  • Normal Distribution
  • Parkinson Disease / diagnosis*
  • Parkinson Disease / physiopathology
  • Support Vector Machine