Pre-existing and machine learning-based models for cardiovascular risk prediction

Sang-Yeong Cho; Sun-Hwa Kim; Si-Hyuck Kang; Kyong Joon Lee; Dongjun Choi; Seungjin Kang; Sang Jun Park; Tackeun Kim; Chang-Hwan Yoon; Tae-Jin Youn; In-Ho Chae

doi:10.1038/s41598-021-88257-w

Pre-existing and machine learning-based models for cardiovascular risk prediction

Sci Rep. 2021 Apr 26;11(1):8886. doi: 10.1038/s41598-021-88257-w.

Authors

Sang-Yeong Cho¹, Sun-Hwa Kim², Si-Hyuck Kang^{3

4}, Kyong Joon Lee⁵, Dongjun Choi⁵, Seungjin Kang⁶, Sang Jun Park⁷, Tackeun Kim⁸, Chang-Hwan Yoon^{2

9}, Tae-Jin Youn^{2

9}, In-Ho Chae^{2

9}

Affiliations

¹ Department of Cardiology, Gyeongsang National University School of Medicine and Gyeongsang National University Changwon Hospital, Changwon, Korea.
² Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital, 82, Gumi-Ro 173 Beon-Gil, Bundang-Gu, Seongnam-si, 13620, Gyeonggi-Do, Korea.
³ Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital, 82, Gumi-Ro 173 Beon-Gil, Bundang-Gu, Seongnam-si, 13620, Gyeonggi-Do, Korea. eandp303@snu.ac.kr.
⁴ Department of Internal Medicine, Seoul National University, Seoul, Korea. eandp303@snu.ac.kr.
⁵ Department of Radiology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea.
⁶ Office of eHealth Research and Businesses, Seoul National University Bundang Hospital, Seongnam-si, Korea.
⁷ Department of Ophthalmology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea.
⁸ Department of Neurosurgery, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea.
⁹ Department of Internal Medicine, Seoul National University, Seoul, Korea.

Abstract

Predicting the risk of cardiovascular disease is the key to primary prevention. Machine learning has attracted attention in analyzing increasingly large, complex healthcare data. We assessed discrimination and calibration of pre-existing cardiovascular risk prediction models and developed machine learning-based prediction algorithms. This study included 222,998 Korean adults aged 40-79 years, naïve to lipid-lowering therapy, had no history of cardiovascular disease. Pre-existing models showed moderate to good discrimination in predicting future cardiovascular events (C-statistics 0.70-0.80). Pooled cohort equation (PCE) specifically showed C-statistics of 0.738. Among other machine learning models such as logistic regression, treebag, random forest, and adaboost, the neural network model showed the greatest C-statistic (0.751), which was significantly higher than that for PCE. It also showed improved agreement between the predicted risk and observed outcomes (Hosmer-Lemeshow χ² = 86.1, P < 0.001) than PCE for whites did (Hosmer-Lemeshow χ² = 171.1, P < 0.001). Similar improvements were observed for Framingham risk score, systematic coronary risk evaluation, and QRISK3. This study demonstrated that machine learning-based algorithms could improve performance in cardiovascular risk prediction over contemporary cardiovascular risk models in statin-naïve healthy Korean adults without cardiovascular disease. The model can be easily adopted for risk assessment and clinical decision making.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Cardiovascular Diseases / diagnosis*
Female
Humans
Machine Learning*
Male
Middle Aged
Models, Cardiovascular*
Risk Assessment
Risk Factors