Somatic Super-Enhancer Epigenetic Signature for Overall Survival Prediction in Patients with Breast Invasive Carcinoma

Bioinform Biol Insights. 2023 Mar 30:17:11779322231162767. doi: 10.1177/11779322231162767. eCollection 2023.

Abstract

To analyze genome-wide super-enhancers (SEs) methylation signature of breast invasive carcinoma (BRCA) and its clinical value. Differential methylation sites (DMS) between BRCA and adjacent tissues from The Cancer Genome Atlas (TCGA) database were identified by using ChAMP package in R software. Super-enhancers were identified sing ROSE software. Overlap analysis was used to assess the potential DMS in SEs region. Feature selection was performed by Cox regression and least absolute shrinkage and selection operator (LASSO) algorithm based on TCGA training cohort. Prognosis model validation was performed in TCGA training cohort, TCGA validation cohort, and gene expression omnibus (GEO) test cohort. The gene ontology and KEGG analysis revealed that SEs target genes were significantly enriched in cell-migration-associated processes and pathways. A total of 83 654 DMS were identified between BRCA and adjacent tissues. Around 2397 DMS in SEs region were identified by overlap study and used to feature selection. By using Cox regression and LASSO algorithm, 42 features were selected to develop a clinical prediction model (CPM). Both training (TCGA) and validation cohorts (TCGA and GEO) show that the CPM has ideal discrimination and calibration. The CPM based on DMS at SE regions has ideal discrimination and calibration, which combined with tumor node metastasis (TNM) stage could improve prognostication, and thus contribute to individualized medicine.

Keywords: H3K27ac; Super-enhancers; breast invasive carcinoma; methylation; prognosis.