A new machine learning technique for an accurate diagnosis of coronary artery disease

Comput Methods Programs Biomed. 2019 Oct:179:104992. doi: 10.1016/j.cmpb.2019.104992. Epub 2019 Jul 24.

Abstract

Background and objective: Coronary artery disease (CAD) is one of the commonest diseases around the world. An early and accurate diagnosis of CAD allows a timely administration of appropriate treatment and helps to reduce the mortality. Herein, we describe an innovative machine learning methodology that enables an accurate detection of CAD and apply it to data collected from Iranian patients.

Methods: We first tested ten traditional machine learning algorithms, and then the three-best performing algorithms (three types of SVM) were used in the rest of the study. To improve the performance of these algorithms, a data preprocessing with normalization was carried out. Moreover, a genetic algorithm and particle swarm optimization, coupled with stratified 10-fold cross-validation, were used twice: for optimization of classifier parameters and for parallel selection of features.

Results: The presented approach enhanced the performance of all traditional machine learning algorithms used in this study. We also introduced a new optimization technique called N2Genetic optimizer (a new genetic training). Our experiments demonstrated that N2Genetic-nuSVM provided the accuracy of 93.08% and F1-score of 91.51% when predicting CAD outcomes among the patients included in a well-known Z-Alizadeh Sani dataset. These results are competitive and comparable to the best results in the field.

Conclusions: We showed that machine-learning techniques optimized by the proposed approach, can lead to highly accurate models intended for both clinical and research use.

Keywords: Classification; Coronary artery disease (CAD); Feature selection; Genetic algorithm; Machine learning; Normalization; Particle swarm optimization.

MeSH terms

  • Algorithms
  • Coronary Artery Disease / diagnosis*
  • Data Mining / statistics & numerical data
  • Databases, Factual / statistics & numerical data
  • Diagnosis, Computer-Assisted / statistics & numerical data
  • Female
  • Humans
  • Machine Learning* / statistics & numerical data
  • Male
  • Models, Cardiovascular
  • Support Vector Machine / statistics & numerical data