Early stage NSCLS patients' prognostic prediction with multi-information using transformer and graph neural network model

Elife. 2022 Oct 4:11:e80547. doi: 10.7554/eLife.80547.

Abstract

Background: We proposed a population graph with Transformer-generated and clinical features for the purpose of predicting overall survival (OS) and recurrence-free survival (RFS) for patients with early stage non-small cell lung carcinomas and to compare this model with traditional models.

Methods: The study included 1705 patients with lung cancer (stages I and II), and a public data set for external validation (n=127). We proposed a graph with edges representing non-imaging patient characteristics and nodes representing imaging tumour region characteristics generated by a pretrained Vision Transformer. The model was compared with a TNM model and a ResNet-Graph model. To evaluate the models' performance, the area under the receiver operator characteristic curve (ROC-AUC) was calculated for both OS and RFS prediction. The Kaplan-Meier method was used to generate prognostic and survival estimates for low- and high-risk groups, along with net reclassification improvement (NRI), integrated discrimination improvement (IDI), and decision curve analysis. An additional subanalysis was conducted to examine the relationship between clinical data and imaging features associated with risk prediction.

Results: Our model achieved AUC values of 0.785 (95% confidence interval [CI]: 0.716-0.855) and 0.695 (95% CI: 0.603-0.787) on the testing and external data sets for OS prediction, and 0.726 (95% CI: 0.653-0.800) and 0.700 (95% CI: 0.615-0.785) for RFS prediction. Additional survival analyses indicated that our model outperformed the present TNM and ResNet-Graph models in terms of net benefit for survival prediction.

Conclusions: Our Transformer-Graph model was effective at predicting survival in patients with early stage lung cancer, which was constructed using both imaging and non-imaging clinical features. Some high-risk patients were distinguishable by using a similarity score function defined by non-imaging characteristics such as age, gender, histology type, and tumour location, while Transformer-generated features demonstrated additional benefits for patients whose non-imaging characteristics were non-discriminatory for survival outcomes.

Funding: The study was supported by the National Natural Science Foundation of China (91959126, 8210071009), and Science and Technology Commission of Shanghai Municipality (20XD1403000, 21YF1438200).

Keywords: computational biology; computed tomography; lung cancer; medical imaging; medicine; none; prognostic model; survival; systems biology; transformer cnn.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Humans
  • Lung Neoplasms* / diagnostic imaging
  • Neural Networks, Computer
  • Prognosis
  • ROC Curve

Grants and funding

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.