Construction of a Prognosis-Related Gene Signature by Weighted Gene Coexpression Network Analysis in Ewing Sarcoma

Comput Math Methods Med. 2022 Jan 27:2022:8798624. doi: 10.1155/2022/8798624. eCollection 2022.

Abstract

Background: Ewing sarcoma (ES) is the second most common pediatric bone tumor with a high rate of metastasis, high recurrence, and low survival rate. Therefore, the identification of new biomarkers which can improve the prognosis of ES patients is urgently needed.

Methods: Here, GSE17679 dataset was downloaded from GEO databases. WGCNA method was used to identify one module associating with OVS (overall vital survival) and event. cytoHubba was used to screen out 50 hub genes from the module genes. Then, GSE17679 dataset was randomly divided into train cohort and test cohort. Next, univariate Cox analysis, LASSO regression analysis, and multivariate Cox analysis were conducted on 50 hub genes combined with train cohort data to select pivotal genes. Finally, an optimal 7-gene-based risk assessment model was established, which was verified by test cohort, entire GSE17679, and two independent datasets (GSE63157 and TCGA-SARC).

Results: The results of the functional enrichment analysis revealed that the OVS and event-associated module were mainly enriched in the protein transcription, cell proliferation, and cell-cycle control. And the train cohort was divided into high-risk and low-risk subgroups based on the median risk score; the results showed that the survival of the low-risk subgroup was significantly longer than high-risk. ROC analysis revealed that AUC values of 1, 3, and 5-year survival were 0.85, 0.94, and 0.88, and Kaplan-Meier analysis also revealed that P value < 0.0001, indicating that this model was accurate, which was also verified in the test, entire cohort, and two independent datasets (GSE63157 and TCGA-SARC). Then, we performed a comprehensive analysis (differential expression analysis, correlation analysis and survival analysis) of seven pivotal genes, and found that four genes (NCAPG, KIF4A, NUF2 and CDC20) plays a more crucial role in the prognosis of ES.

Conclusion: Taken together, this study established an optimal 7-gene-based risk assessment model and identified 4 potential therapeutic targets, to improve the prognosis of ES patients.

Publication types

  • Validation Study

MeSH terms

  • Biomarkers, Tumor / genetics
  • Bone Neoplasms / genetics*
  • Child
  • Cohort Studies
  • Computational Biology
  • Databases, Genetic / statistics & numerical data
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Gene Ontology
  • Gene Regulatory Networks*
  • Humans
  • Kaplan-Meier Estimate
  • Male
  • Nomograms
  • Prognosis
  • Proportional Hazards Models
  • Protein Interaction Maps / genetics
  • Risk Assessment
  • Sarcoma, Ewing / genetics*

Substances

  • Biomarkers, Tumor