Delving into the Heterogeneity of Different Breast Cancer Subtypes and the Prognostic Models Utilizing scRNA-Seq and Bulk RNA-Seq

Int J Mol Sci. 2022 Sep 1;23(17):9936. doi: 10.3390/ijms23179936.

Abstract

Background: Breast cancer (BC) is the most common malignancy in women with high heterogeneity. The heterogeneity of cancer cells from different BC subtypes has not been thoroughly characterized and there is still no valid biomarker for predicting the prognosis of BC patients in clinical practice.

Methods: Cancer cells were identified by calculating single cell copy number variation using the inferCNV algorithm. SCENIC was utilized to infer gene regulatory networks. CellPhoneDB software was used to analyze the intercellular communications in different cell types. Survival analysis, univariate Cox, least absolute shrinkage and selection operator (LASSO) regression and multivariate Cox analysis were used to construct subtype specific prognostic models.

Results: Triple-negative breast cancer (TNBC) has a higher proportion of cancer cells than subtypes of HER2+ BC and luminal BC, and the specifically upregulated genes of the TNBC subtype are associated with antioxidant and chemical stress resistance. Key transcription factors (TFs) of tumor cells for three subtypes varied, and most of the TF-target genes are specifically upregulated in corresponding BC subtypes. The intercellular communications mediated by different receptor-ligand pairs lead to an inflammatory response with different degrees in the three BC subtypes. We establish a prognostic model containing 10 genes (risk genes: ATP6AP1, RNF139, BASP1, ESR1 and TSKU; protective genes: RPL31, PAK1, STARD10, TFPI2 and SIAH2) for luminal BC, seven genes (risk genes: ACTR6 and C2orf76; protective genes: DIO2, DCXR, NDUFA8, SULT1A2 and AQP3) for HER2+ BC, and seven genes (risk genes: HPGD, CDC42 and PGK1; protective genes: SMYD3, LMO4, FABP7 and PRKRA) for TNBC. Three prognostic models can distinguish high-risk patients from low-risk patients and accurately predict patient prognosis.

Conclusions: Comparative analysis of the three BC subtypes based on cancer cell heterogeneity in this study will be of great clinical significance for the diagnosis, prognosis and targeted therapy for BC patients.

Keywords: breast cancer; bulk RNA-seq; cancer cells; intercellular communication; prognosis; scRNA-seq.

MeSH terms

  • Actins
  • Adaptor Proteins, Signal Transducing / metabolism
  • Biomarkers, Tumor / genetics
  • Breast Neoplasms* / metabolism
  • Chromosomal Proteins, Non-Histone
  • DNA Copy Number Variations
  • Female
  • Histone-Lysine N-Methyltransferase
  • Humans
  • LIM Domain Proteins / genetics
  • Prognosis
  • RNA-Seq
  • Receptors, Cell Surface / metabolism
  • Single-Cell Analysis
  • Triple Negative Breast Neoplasms* / pathology
  • Vacuolar Proton-Translocating ATPases* / metabolism

Substances

  • ACTR6 protein, human
  • ATP6AP1 protein, human
  • Actins
  • Adaptor Proteins, Signal Transducing
  • Biomarkers, Tumor
  • Chromosomal Proteins, Non-Histone
  • LIM Domain Proteins
  • LMO4 protein, human
  • RNF139 protein, human
  • Receptors, Cell Surface
  • Histone-Lysine N-Methyltransferase
  • SMYD3 protein, human
  • Vacuolar Proton-Translocating ATPases