Decision tree-based diagnosis of coronary artery disease: CART model

Comput Methods Programs Biomed. 2020 Aug:192:105400. doi: 10.1016/j.cmpb.2020.105400. Epub 2020 Feb 19.

Abstract

Background and objective: As the most common cardiovascular defect, coronary artery disease (CAD), also called ischemic heart disease, is one of the substantial causes of death globally. Several diagnosis approaches such as baseline electrocardiography, echocardiography, magnetic resonance imaging, and coronary angiography are suggested for screening the suspected patients that may suffer from CAD. However, applying such methods may have health side effects and/or expensive costs.

Methods: As an alternative to the available diagnosis tools/methods, this research involves a decision tree learning algorithm called classification and regression tree (CART) for a simple and reliable diagnosis of CAD. Several CART models are developed based on the recently CAD dataset published in the literature.

Results: Utilizing all the features of the dataset (55 independent parameters), it was found that only 40 independent parameters influence the CAD diagnosis and consequently development of the predictive model. Based on the feature importance obtained from the first CART model, three new CART models are then developed using 18, 10, and 5 selected features. Except for the five-feature CART model, the outcomes of developed CART models demonstrate the maximum achievable accuracy, sensitivity, and specificity for CAD diagnosis (100%), while comparing the predictions with the reported targets. The error analysis reveals that the literature models including sequential minimal optimization (SMO), bagging SMO, Naïve Bayes (NB), artificial neural network (ANN), C4.5, J48, Bagging, and ANN in conjunction with the genetic algorithm (GA) do not outperform the CART methodology in classifying patients as normal or CAD.

Conclusions: Hence, the robustness of the tree-based algorithm in accurate and fast predictions is confirmed, implying the proposed classification technique can be successfully utilized to develop a coherent decision-making system for the CAD diagnosis.

Keywords: Classification; Classification and regression tree; Coronary artery disease; Decision tree; Error analysis.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms*
  • Coronary Artery Disease / diagnosis*
  • Decision Trees*
  • Female
  • Humans
  • Male
  • Middle Aged
  • Neural Networks, Computer
  • Regression Analysis
  • Young Adult