Construction and validation of an RNA-binding protein-associated prognostic model for colorectal cancer

PeerJ. 2021 Apr 5:9:e11219. doi: 10.7717/peerj.11219. eCollection 2021.

Abstract

Colorectal cancer (CRC) is one of the most prevalent and fatal malignancies, and novel biomarkers for the diagnosis and prognosis of CRC must be identified. RNA-binding proteins (RBPs) are essential modulators of transcription and translation. They are frequently dysregulated in various cancers and are related to tumorigenesis and development. The mechanisms by which RBPs regulate CRC progression are poorly understood and no clinical prognostic model using RBPs has been reported in CRC. We sought to identify the hub prognosis-related RBPs and to construct a prognostic model for clinical use. mRNA sequencing and clinical data for CRC were obtained from The Cancer Genome Atlas database (TCGA). Gene expression profiles were analyzed to identify differentially expressed RBPs using R and Perl software. Hub RBPs were filtered out using univariate Cox and multivariate Cox regression analysis. We used functional enrichment analysis, including Gene Ontology and Gene Set Enrichment Analysis, to perform the function and mechanisms of the identified RBPs. The nomogram predicted overall survival (OS). Calibration curves were used to evaluate the consistency between the predicted and actual survival rate, the consistency index (c-index) was calculated, and the prognostic effect of the model was evaluated. Finally, we identified 178 differently expressed RBPs, including 121 up-regulated and 57 down-regulated proteins. Our prognostic model was based on nine RBPs (PNLDC1, RRS1, HEXIM1, PPARGC1A, PPARGC1B, BRCA1, CELF4, AEN and NOVA1). Survival analysis showed that patients in the high-risk subgroup had a worse OS than those in the low-risk subgroup. The area under the curve value of the receiver operating characteristic curve of the prognostic model is 0.712 in the TCGA cohort and 0.638 in the GEO cohort. These results show that the model has a moderate diagnostic ability. The c-index of the nomogram is 0.77 in the TCGA cohort and 0.73 in the GEO cohort. We showed that the risk score is an independent prognostic biomarker and that some RBPs may be potential biomarkers for the diagnosis and prognosis of CRC.

Keywords: Bioinformatic analysis; Colorectal cancer; GEO; Prognostic model; RNA-binding proteins; TCGA.

Grants and funding

The authors received no funding for this work.