Machine learning‑based prediction of survival prognosis in esophageal squamous cell carcinoma

Sci Rep. 2023 Aug 19;13(1):13532. doi: 10.1038/s41598-023-40780-8.

Abstract

The current prognostic tools for esophageal squamous cell carcinoma (ESCC) lack the necessary accuracy to facilitate individualized patient management strategies. To address this issue, this study was conducted to develop a machine learning (ML) prediction model for ESCC patients' survival management. Six ML approaches, including Rpart, Elastic Net, GBM, Random Forest, GLMboost, and the machine learning-extended CoxPH method, were employed to develop risk prediction models. The model was trained on a dataset of 1954 ESCC patients with 27 clinical features and validated on a dataset of 487 ESCC patients. The discriminative performance of the models was assessed using the concordance index (C-index). The best performing model was used for risk stratification and clinical evaluation. The study found that N stage, T stage, surgical margin, tumor grade, tumor length, sex, MPV, AST, FIB, and Mg are the important feature for ESCC patients' survival. The machine learning-extended CoxPH model, Elastic Net, and Random Forest had similar performance in predicting the mortality risk of ESCC patients, and outperformed GBM, GLMboost, and Rpart. The risk scores derived from the CoxPH model effectively stratified ESCC patients into low-, intermediate-, and high-risk groups with distinctly different 3-year overall survival (OS) probabilities of 80.8%, 58.2%, and 29.5%, respectively. This risk stratification was also observed in the validation cohort. Furthermore, the risk model demonstrated greater discriminative ability and net benefit than the AJCC8th stage, suggesting its potential as a prognostic tool for predicting survival events and guiding clinical decision-making. The classical algorithm of the CoxPH method was also found to be sufficiently good for interpretive studies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Esophageal Neoplasms*
  • Esophageal Squamous Cell Carcinoma*
  • Humans
  • Machine Learning
  • Prognosis
  • Risk Factors