A novel analytical framework for risk stratification of real-world data using machine learning: A small cell lung cancer study

Clin Transl Sci. 2022 Oct;15(10):2437-2447. doi: 10.1111/cts.13371. Epub 2022 Jul 29.

Abstract

In recent studies, small cell lung cancer (SCLC) treatment guidelines based on Veterans' Administration Lung Study Group limited/extensive disease staging and resulted in broad and inseparable prognostic subgroups. Evidence suggests that the eight versions of tumor, node, and metastasis (TNM) staging can play an important role to address this issue. The aim of the present study was to improve the detection of prognostic subgroups from a real-word data (RWD) cohort of patients and analyze their patterns using a development pipeline with thoracic oncologists and machine learning methods. The method detected subgroups of patients informing unsupervised learning (partition around medoids) including the impact of covariates on prognosis (Cox regression and random survival forest). An analysis was carried out using patients with SCLC (n = 636) with stage IIIA-IVB according to TNM classification. The analysis yielded k = 7 compacted and well-separated clusters of patients. Performance status (Eastern Cooperative Oncology Group-Performance Status), lactate dehydrogenase, spreading of metastasis, cancer stage, and CRP were the baselines that characterized the subgroups. The selected clustering method outperformed standard clustering techniques, which were not capable of detecting meaningful subgroups. From the analysis of cluster treatment decisions, we showed the potential of future RWD applications to understand disease, develop individualized therapies, and improve healthcare decision making.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Lactate Dehydrogenases
  • Lung Neoplasms* / pathology
  • Machine Learning
  • Neoplasm Staging
  • Prognosis
  • Retrospective Studies
  • Risk Assessment
  • Small Cell Lung Carcinoma* / diagnosis
  • Small Cell Lung Carcinoma* / pathology
  • Small Cell Lung Carcinoma* / therapy

Substances

  • Lactate Dehydrogenases