A nine-gene signature to improve prognosis prediction of colon carcinoma

Cell Cycle. 2021 May;20(10):1021-1032. doi: 10.1080/15384101.2021.1919827. Epub 2021 May 14.

Abstract

This study aims to establish a gene model that can robustly and effectively predict the prognosis of colon carcinoma patients via bioinformatics. Data along with clinical information in GSE39582 Series Matrix were firstly downloaded from Gene Expression Omnibus (GEO) database. Next, differentially expressed genes (DEGs) were obtained through "edgeR" analysis. Finally, a risk predication model was established through a series of regression analyses, and then prognostic performance of the model was comprehensively evaluated though Kaplan-Meier and receiver operating characteristic (ROC) analysis. Gene set enrichment analysis (GSEA) was further performed. Totally, 846 DEGs were obtained by analyzing the gene expression data in GSE39582 dataset. A 9-gene signature-based risk predication model was established via regression analyses, and the model-based risk score was formulated as: Riskscore = (-0.1214) * TNFRSF11A + (-0.2617) * TMEM97 + (-0.1041) * LGR5 + 0.0973 * KLK10 + 0.1655 * HOXB8 + 0.227 * FKBP10 + (-0.1312) * CXCL13 + (-0.1316) * CXCL10 + 0.2593 * CD36. Kaplan-Meier curve showed that colon carcinoma patients in the high-risk group had a lower survival rate. GSEA showed that high-risk group and low-risk group displayed significant difference in biological pathways including ECM RECEPTOR INTERACTION. Besides, correlation analysis between the riskscore of the model and clinical features of patients revealed that the model could effectively predict the prognosis of patients in different ages (age>65, age<65) and stages (tumor_stage I/II, tumor_stage III/IV, T3&T4, N0&N1, N2&N3, M0). This study provides a robust model for the prognosis prediction of colon carcinoma, and lays a basis for researching the molecular mechanism underlying the development of colon carcinoma.

Keywords: Colon carcinoma; gene signature; prognosis prediction.

MeSH terms

  • Aged
  • Algorithms
  • Colonic Neoplasms / genetics*
  • Colonic Neoplasms / pathology
  • Databases, Genetic
  • Female
  • Gene Expression Profiling*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Male
  • Middle Aged
  • Multivariate Analysis
  • Prognosis
  • Proportional Hazards Models
  • Risk Assessment
  • Survival Analysis