Pan-cancer analysis of somatic mutations and epigenetic alterations in insulated neighbourhood boundaries

PLoS One. 2020 Jan 16;15(1):e0227180. doi: 10.1371/journal.pone.0227180. eCollection 2020.

Abstract

Recent evidence shows that the disruption of constitutive insulated neighbourhoods might lead to oncogene dysregulation. We present here a systematic pan-cancer characterisation of the associations between constitutive boundaries and genome alterations in cancer. Specifically, we investigate the enrichment of somatic mutation, abnormal methylation, and copy number alteration events in the proximity of CTCF bindings overlapping with topological boundaries (junctions) in 26 cancer types. Focusing on CTCF motifs that are both in-boundary (overlapping with junctions) and active (overlapping with peaks of CTCF expression), we find a significant enrichment of somatic mutations in several cancer types. Furthermore, mutated junctions are significantly conserved across cancer types, and we also observe a positive selection of transversions rather than transitions in many cancer types. We also analyzed the mutational signature found on the different classes of CTCF motifs, finding some signatures (such as SBS26) to have a higher weight within in-boundary than off-bounday motifs. Regarding methylation, we find a significant number of over-methylated active in-boundary CTCF motifs in several cancer types; similarly to somatic-mutated junctions, they also have a significant conservation across cancer types. Finally, in several cancer types we observe that copy number alterations tend to overlap with active junctions more often than in matched normal samples. While several articles have recently reported a mutational enrichment at CTCF binding sites for specific cancer types, our analysis is pan-cancer and investigates abnormal methylation and copy number alterations in addition to somatic mutations. Our method is fully replicable and suggests several follow-up tumour-specific analyses.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs / genetics
  • Binding Sites / genetics
  • CCCTC-Binding Factor / genetics*
  • CCCTC-Binding Factor / metabolism*
  • Chromosomes, Human, Pair 11 / genetics
  • DNA Copy Number Variations / genetics
  • DNA Methylation
  • DNA Mutational Analysis / methods*
  • Epigenesis, Genetic / genetics*
  • Exons / genetics
  • Female
  • Gene Expression Regulation, Neoplastic / genetics
  • Genome, Human / genetics
  • Humans
  • Insulator Elements / genetics*
  • Mutation Rate
  • Neoplasms / genetics*
  • Point Mutation*
  • Promoter Regions, Genetic / genetics

Substances

  • CCCTC-Binding Factor
  • CTCF protein, human

Grants and funding

SC, PP, ES are supported by the ERC Advanced Grant 693174 GeCo (Data-Driven Genomic Computing); MRM and AN by European Union’s Horizon 2020 research and innovation programme under grant agreement No 668858.