Estimation of an Image Biomarker for Distant Recurrence Prediction in NSCLC Using Proliferation-Related Genes

Int J Mol Sci. 2023 Feb 1;24(3):2794. doi: 10.3390/ijms24032794.

Abstract

This study aimed to identify a distant-recurrence image biomarker in NSCLC by investigating correlations between heterogeneity functional gene expression and fluorine-18-2-fluoro-2-deoxy-D-glucose positron emission tomography (18F-FDG PET) image features of NSCLC patients. RNA-sequencing data and 18F-FDG PET images of 53 patients with NSCLC (19 with distant recurrence and 34 without recurrence) from The Cancer Imaging Archive and The Cancer Genome Atlas Program databases were used in a combined analysis. Weighted correlation network analysis was performed to identify gene groups related to distant recurrence. Genes were selected for functions related to distant recurrence. In total, 47 image features were extracted from PET images as radiomics. The relationship between gene expression and image features was estimated using a hypergeometric distribution test with the Pearson correlation method. The distant recurrence prediction model was validated by a random forest (RF) algorithm using image texture features and related gene expression. In total, 37 gene modules were identified by gene-expression pattern with weighted gene co-expression network analysis. The gene modules with the highest significance were selected (p-value < 0.05). Nine genes with high protein-protein interaction and area under the curve (AUC) were identified as hub genes involved in the proliferation function, which plays an important role in distant recurrence of cancer. Four image features (GLRLM_SRHGE, GLRLM_HGRE, SUVmean, and GLZLM_GLNU) and six genes were identified to be correlated (p-value < 0.1). AUCs (accuracy: 0.59, AUC: 0.729) from the 47 image texture features and AUCs (accuracy: 0.767, AUC: 0.808) from hub genes were calculated using the RF algorithm. AUCs (accuracy: 0.783, AUC: 0.912) from the four image texture features and six correlated genes and AUCs (accuracy: 0.738, AUC: 0.779) from only the four image texture features were calculated using the RF algorithm. The four image texture features validated by heterogeneity group gene expression were found to be related to cancer heterogeneity. The identification of these image texture features demonstrated that advanced prediction of NSCLC distant recurrence is possible using the image biomarker.

Keywords: 18F-FDG PET; RNA-sequence; distant recurrence; non-small cell lung cancer; prediction model.

MeSH terms

  • Biomarkers
  • Carcinoma, Non-Small-Cell Lung* / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung* / genetics
  • Cell Proliferation
  • Fluorodeoxyglucose F18
  • Humans
  • Lung Neoplasms* / diagnostic imaging
  • Lung Neoplasms* / genetics
  • Positron Emission Tomography Computed Tomography / methods
  • Retrospective Studies

Substances

  • Fluorodeoxyglucose F18
  • Biomarkers