COCOA: coordinate covariation analysis of epigenetic heterogeneity

Genome Biol. 2020 Sep 7;21(1):240. doi: 10.1186/s13059-020-02139-4.

Abstract

A key challenge in epigenetics is to determine the biological significance of epigenetic variation among individuals. We present Coordinate Covariation Analysis (COCOA), a computational framework that uses covariation of epigenetic signals across individuals and a database of region sets to annotate epigenetic heterogeneity. COCOA is the first such tool for DNA methylation data and can also analyze any epigenetic signal with genomic coordinates. We demonstrate COCOA's utility by analyzing DNA methylation, ATAC-seq, and multi-omic data in supervised and unsupervised analyses, showing that COCOA provides new understanding of inter-sample epigenetic variation. COCOA is available on Bioconductor ( http://bioconductor.org/packages/COCOA ).

Keywords: Cancer; Chromatin accessibility; DNA methylation; Data integration; Dimensionality reduction; EZH2; Epigenetics; Multi-omics; Principal component analysis.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms / genetics
  • DNA Methylation
  • Epigenesis, Genetic*
  • Epigenomics / methods*
  • Genetic Heterogeneity*
  • Humans
  • Molecular Sequence Annotation
  • Software*