Machine learning for the prediction of atherosclerotic cardiovascular disease during 3-year follow up in Chinese type 2 diabetes mellitus patients

J Diabetes Investig. 2023 Nov;14(11):1289-1302. doi: 10.1111/jdi.14069. Epub 2023 Aug 22.

Abstract

Aims/introduction: Clinical guidelines for the management of individuals with type 2 diabetes mellitus endorse the systematic assessment of atherosclerotic cardiovascular disease risk for early interventions. In this study, we aimed to develop machine learning models to predict 3-year atherosclerotic cardiovascular disease risk in Chinese type 2 diabetes mellitus patients.

Materials and methods: Clinical records of 4,722 individuals with type 2 diabetes mellitus admitted to 94 hospitals were used. The features included demographic information, disease histories, laboratory tests and physical examinations. Logistic regression, support vector machine, gradient boosting decision tree, random forest and adaptive boosting were applied for model construction. The performance of these models was evaluated using the area under the receiver operating characteristic curve. Additionally, we applied SHapley Additive exPlanation values to explain the prediction model.

Results: All five models achieved good performance in both internal and external test sets (area under the receiver operating characteristic curve >0.8). Random forest showed the highest discrimination ability, with sensitivity and specificity being 0.838 and 0.814, respectively. The SHapley Additive exPlanation analyses showed that previous history of diabetic peripheral vascular disease, older populations and longer diabetes duration were the three most influential predictors.

Conclusions: The prediction models offer opportunities to personalize treatment and maximize the benefits of these medical interventions.

Keywords: Atherosclerotic cardiovascular disease; Machine learning; Type 2 diabetes mellitus.

MeSH terms

  • Atherosclerosis* / diagnosis
  • Cardiovascular Diseases* / diagnosis
  • Cardiovascular Diseases* / epidemiology
  • Cardiovascular Diseases* / etiology
  • Diabetes Mellitus, Type 2* / complications
  • Diabetes Mellitus, Type 2* / diagnosis
  • East Asian People
  • Follow-Up Studies
  • Humans
  • Machine Learning