Machine learning to predict cardiovascular risk

Int J Clin Pract. 2019 Oct;73(10):e13389. doi: 10.1111/ijcp.13389. Epub 2019 Aug 4.

Abstract

Aims: To analyse the predictive capacity of 15 machine learning methods for estimating cardiovascular risk in a cohort and to compare them with other risk scales.

Methods: We calculated cardiovascular risk by means of 15 machine-learning methods and using the SCORE and REGICOR scales and in 38 527 patients in the Spanish ESCARVAL RISK cohort, with 5-year follow-up. We considered patients to be at high risk when the risk of a cardiovascular event was over 5% (according to SCORE and machine learning methods) or over 10% (using REGICOR). The area under the receiver operating curve (AUC) and the C-index were calculated, as well as the diagnostic accuracy rate, error rate, sensitivity, specificity, positive and negative predictive values, positive likelihood ratio, and number needed to treat to prevent a harmful outcome.

Results: The method with the greatest predictive capacity was quadratic discriminant analysis, with an AUC of 0.7086, followed by Naive Bayes and neural networks, with AUCs of 0.7084 and 0.7042, respectively. REGICOR and SCORE ranked 11th and 12th, respectively, in predictive capacity, with AUCs of 0.63. Seven machine learning methods showed a 7% higher predictive capacity (AUC) as well as higher sensitivity and specificity than the REGICOR and SCORE scales.

Conclusions: Ten of the 15 machine learning methods tested have a better predictive capacity for cardiovascular events and better classification indicators than the SCORE and REGICOR risk assessment scales commonly used in clinical practice in Spain. Machine learning methods should be considered in the development of future cardiovascular risk scales.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms*
  • Area Under Curve
  • Bayes Theorem
  • Cardiovascular Diseases / diagnosis
  • Cardiovascular Diseases / epidemiology*
  • Cardiovascular Diseases / etiology
  • Cohort Studies
  • Female
  • Follow-Up Studies
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Predictive Value of Tests
  • Risk Factors
  • Spain / epidemiology