Machine Learning Algorithm to Predict Obstructive Coronary Artery Disease: Insights from the CorLipid Trial

Eleftherios Panteris; Olga Deda; Andreas S Papazoglou; Efstratios Karagiannidis; Theodoros Liapikos; Olga Begou; Thomas Meikopoulos; Thomai Mouskeftara; Georgios Sofidis; Georgios Sianos; Georgios Theodoridis; Helen Gika

doi:10.3390/metabo12090816

Machine Learning Algorithm to Predict Obstructive Coronary Artery Disease: Insights from the CorLipid Trial

Metabolites. 2022 Aug 30;12(9):816. doi: 10.3390/metabo12090816.

Authors

Eleftherios Panteris^{1

2}, Olga Deda^{1

2}, Andreas S Papazoglou³, Efstratios Karagiannidis³, Theodoros Liapikos⁴, Olga Begou^{2

4}, Thomas Meikopoulos^{2

4}, Thomai Mouskeftara^{1

2}, Georgios Sofidis³, Georgios Sianos³, Georgios Theodoridis^{2

4}, Helen Gika^{1

2}

Affiliations

¹ Laboratory of Forensic Medicine and Toxicology, School of Medicine, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
² Biomic_Auth, Bioanalysis and Omics Lab, Centre for Interdisciplinary Research of Aristotle University of Thessaloniki, 57001 Thermi, Greece.
³ First Department of Cardiology, AHEPA University Hospital, Aristotle University of Thessaloniki, St. Kiriakidi 1, 54636 Thessaloniki, Greece.
⁴ Laboratory of Analytical Chemistry, Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.

Abstract

Developing risk assessment tools for CAD prediction remains challenging nowadays. We developed an ML predictive algorithm based on metabolic and clinical data for determining the severity of CAD, as assessed via the SYNTAX score. Analytical methods were developed to determine serum blood levels of specific ceramides, acyl-carnitines, fatty acids, and proteins such as galectin-3, adiponectin, and APOB/APOA1 ratio. Patients were grouped into: obstructive CAD (SS > 0) and non-obstructive CAD (SS = 0). A risk prediction algorithm (boosted ensemble algorithm XGBoost) was developed by combining clinical characteristics with established and novel biomarkers to identify patients at high risk for complex CAD. The study population comprised 958 patients (CorLipid trial (NCT04580173)), with no prior CAD, who underwent coronary angiography. Of them, 533 (55.6%) suffered ACS, 170 (17.7%) presented with NSTEMI, 222 (23.2%) with STEMI, and 141 (14.7%) with unstable angina. Of the total sample, 681 (71%) had obstructive CAD. The algorithm dataset was 73 biochemical parameters and metabolic biomarkers as well as anthropometric and medical history variables. The performance of the XGBoost algorithm had an AUC value of 0.725 (95% CI: 0.691−0.759). Thus, a ML model incorporating clinical features in addition to certain metabolic features can estimate the pre-test likelihood of obstructive CAD.

Keywords: SYNTAX score; acute coronary syndrome; acylcarnitines; atherosclerosis; biomarkers; ceramides; coronary artery disease; lipids; metabolic markers; metabolomics.

Grants and funding

project code: T1EDK-04005/This research has been co-financed by the European Regional Development Fund of the European Union and Greek national funds through the Operational Program Competi-tiveness, Entrepreneurship and Innovation, under the call RESEARCH-CREATE-INNOVATE