Machine learning was used to predict risk factors for distant metastasis of pancreatic cancer and prognosis analysis

J Cancer Res Clin Oncol. 2023 Sep;149(12):10279-10291. doi: 10.1007/s00432-023-04903-y. Epub 2023 Jun 6.

Abstract

Background: The mechanisms of distant metastasis in pancreatic cancer (PC) have not been elucidated, and this study aimed to explore the risk factors affecting the metastasis and prognosis of metastatic patients and to develop a predictive model.

Method: Clinical data from patients meeting criteria from 1990 to 2019 were obtained from the Surveillance, Epidemiology, and End Results (SEER) database, and two machine learning methods, random forest and support vector machine, combined with logistic regression, were used to explore risk factors influencing distant metastasis and to create nomograms. The performance of the model was validated using calibration curves and ROC curves based on the Shaanxi Provincial People's Hospital cohort. LASSO regression and Cox regression models were used to explore the independent risk factors affecting the prognosis of patients with distant PC metastases.

Results: We found that independent risk factors affecting PC distant metastasis were: age, radiotherapy, chemotherapy, T and N; the independent risk factors for patient prognosis were: age, grade, bone metastasis, brain metastasis, lung metastasis, radiotherapy and chemotherapy.

Conclusion: Together, our study provides a method for risk factors and prognostic assessment for patients with distant PC metastases. The nomogram we developed can be used as a convenient individualized tool to facilitate aid in clinical decision making.

Keywords: Machine learning; Pancreatic cancer; The Surveillance, Epidemiology, and End Results Program (SEER); Tumor metastasis; Tumor prognosis.

MeSH terms

  • Humans
  • Machine Learning
  • Nomograms*
  • Pancreatic Neoplasms*
  • Prognosis
  • SEER Program