Emphasis Learning, Features Repetition in Width Instead of Length to Improve Classification Performance: Case Study-Alzheimer's Disease Diagnosis

Sensors (Basel). 2020 Feb 10;20(3):941. doi: 10.3390/s20030941.

Abstract

In the past decade, many studies have been conducted to advance computer-aided systems for Alzheimer's disease (AD) diagnosis. Most of them have recently developed systems concentrated on extracting and combining features from MRI, PET, and CSF. For the most part, they have obtained very high performance. However, improving the performance of a classification problem is complicated, specifically when the model's accuracy or other performance measurements are higher than 90%. In this study, a novel methodology is proposed to address this problem, specifically in Alzheimer's disease diagnosis classification. This methodology is the first of its kind in the literature, based on the notion of replication on the feature space instead of the traditional sample space. Briefly, the main steps of the proposed method include extracting, embedding, and exploring the best subset of features. For feature extraction, we adopt VBM-SPM; for embedding features, a concatenation strategy is used on the features to ultimately create one feature vector for each subject. Principal component analysis is applied to extract new features, forming a low-dimensional compact space. A novel process is applied by replicating selected components, assessing the classification model, and repeating the replication until performance divergence or convergence. The proposed method aims to explore most significant features and highest-preforming model at the same time, to classify normal subjects from AD and mild cognitive impairment (MCI) patients. In each epoch, a small subset of candidate features is assessed by support vector machine (SVM) classifier. This repeating procedure is continued until the highest performance is achieved. Experimental results reveal the highest performance reported in the literature for this specific classification problem. We obtained a model with accuracies of 98.81%, 81.61%, and 81.40% for AD vs. normal control (NC), MCI vs. NC, and AD vs. MCI classification, respectively.

Keywords: Alzheimer’s disease; Emphasis Learning; PCA; SVM; multi-modal classification.

MeSH terms

  • Aged
  • Algorithms*
  • Alzheimer Disease / classification*
  • Alzheimer Disease / diagnosis*
  • Alzheimer Disease / diagnostic imaging
  • Area Under Curve
  • Cognitive Dysfunction / diagnosis
  • Female
  • Humans
  • Learning
  • Magnetic Resonance Imaging
  • Male
  • Predictive Value of Tests
  • Principal Component Analysis
  • ROC Curve
  • Support Vector Machine