Identification of a five genes prognosis signature for triple-negative breast cancer using multi-omics methods and bioinformatics analysis

Cancer Gene Ther. 2022 Nov;29(11):1578-1589. doi: 10.1038/s41417-022-00473-2. Epub 2022 Apr 26.

Abstract

Triple-negative breast cancer (TNBC) has a high degree of malignancy, lack of effective diagnosis and treatment, and poor prognosis. Bioinformatics methods are used to screen the hub genes and signal pathways involved in the progress of TNBC to provide reliable biomarkers for the diagnosis and treatment of TNBC. Download the raw data of four TNBC-related datasets from the Gene Expression Omnibus (GEO) database and use them for bioinformatics analysis. GEO2R tool was used to analyze and identify differentially expressed (DE) mRNAs. DAVID database was used to carry out gene ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genome Pathways (KEGG) signal pathway enrichment analysis for DE mRNAs. STRING database and Cytoscape were used to build DE mRNAs protein-protein interaction (PPI) network diagram and visualize PPI network, respectively. Through cytoHubba, cBioPortal database, Kaplan-Meier mapper database, Gene Expression Profiling Interactive Analysis (GEPIA) Database, UALCAN Database, The Cancer Genome Atlas (TCGA) database, Tumor Immunity Estimation Resource identify hub genes. Perform qRT-PCR, Human Protein Atlas analysis, mutation analysis, survival analysis, clinical-pathological characteristics, and infiltrating immune cell analysis. 22 DE mRNAs were identified from the four datasets, including 16 upregulated DE mRNAs and six downregulated DE mRNAs. Enrichment analysis of the KEGG showed that DE mRNAs were principally enriched in pathways in cancer, mismatch repair, cell cycle, platinum drug resistance, breast cancer. Six hub genes were screened based on the PPI network diagram of DE mRNAs. Survival analysis found that TOP2A, CCNA2, PCNA, MSH2, CDK6 are related to the prognosis of TNBC. In addition, mutations, clinical indicators, and immune infiltration analysis show that these five hub genes play an important role in the progress of TNBC and immune monitoring. Compared with MCF-10A, MCF-7, and SKBR-3 cells, TOP2A, PCNA, MSH2, and CDK6 were significantly upregulated in MDA-MB-321 cells. Compared with normal, luminal, and Her-2 positive tissues, CCNA2, MSH2, and CDK6 were significantly upregulated in TNBC. Through comparative analysis of GEO datasets related to colorectal cancer and lung adenocarcinoma, it was determined that these five hub genes were unique differentially expressed genes of TNBC. At last, the hub genes related to the progression, prognosis, and immunity of TNBC have been successfully screened. They are indeed specific to TNBC as prognostic features. They can be used as potential markers for the prognosis of TNBC and provide potential therapeutic targets.

MeSH terms

  • Computational Biology
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Humans
  • MutS Homolog 2 Protein / genetics
  • Prognosis
  • Proliferating Cell Nuclear Antigen / genetics
  • Triple Negative Breast Neoplasms* / genetics

Substances

  • MutS Homolog 2 Protein
  • Proliferating Cell Nuclear Antigen