Predicting Clinical Outcomes in Colorectal Cancer Using Machine Learning

Stud Health Technol Inform. 2018:247:101-105.

Abstract

Using gene markers and other patient features to predict clinical outcomes plays a vital role in enhancing clinical decision making and improving prognostic accuracy. This work uses a large set of colorectal cancer patient data to train predictive models using machine learning methods such as random forest, general linear model, and neural network for clinically relevant outcomes including disease free survival, survival, radio-chemotherapy response (RCT-R) and relapse. The most successful predictive models were created for dichotomous outcomes like relapse and RCT-R with accuracies of 0.71 and 0.70 on blinded test data respectively. The best prediction models regarding overall survival and disease-free survival had C-Index scores of 0.86 and 0.76 respectively. These models could be used in the future to aid a decision for or against chemotherapy and improve survival prognosis. We propose that future work should focus on creating reusable frameworks and infrastructure for training and delivering predictive models to physicians, so that they could be readily applied to other diseases in practice and be continuously developed integrating new data.

Keywords: Machine-learning; chemotherapy; colorectal cancer; predicting clinical outcomes; relapse; survival prediction.

MeSH terms

  • Colorectal Neoplasms / mortality*
  • Disease-Free Survival
  • Humans
  • Machine Learning*
  • Neural Networks, Computer
  • Prognosis