Imbalanced learning: Improving classification of diabetic neuropathy from magnetic resonance imaging

PLoS One. 2020 Dec 15;15(12):e0243907. doi: 10.1371/journal.pone.0243907. eCollection 2020.

Abstract

One of the fundamental challenges when dealing with medical imaging datasets is class imbalance. Class imbalance happens where an instance in the class of interest is relatively low, when compared to the rest of the data. This study aims to apply oversampling strategies in an attempt to balance the classes and improve classification performance. We evaluated four different classifiers from k-nearest neighbors (k-NN), support vector machine (SVM), multilayer perceptron (MLP) and decision trees (DT) with 73 oversampling strategies. In this work, we used imbalanced learning oversampling techniques to improve classification in datasets that are distinctively sparser and clustered. This work reports the best oversampling and classifier combinations and concludes that the usage of oversampling methods always outperforms no oversampling strategies hence improving the classification results.

MeSH terms

  • Algorithms
  • Decision Trees
  • Diabetes Mellitus / classification
  • Diabetes Mellitus / diagnostic imaging*
  • Diabetes Mellitus / pathology
  • Diabetic Neuropathies / classification
  • Diabetic Neuropathies / diagnostic imaging*
  • Diabetic Neuropathies / pathology
  • Female
  • Humans
  • Machine Learning*
  • Magnetic Resonance Imaging*
  • Male
  • Neuroimaging / methods
  • Support Vector Machine

Grants and funding

The author(s) received no specific funding for this work.