A pooling-based approach to mapping genetic variants associated with DNA methylation

Genome Res. 2015 Jun;25(6):907-17. doi: 10.1101/gr.183749.114. Epub 2015 Apr 24.

Abstract

DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a truly genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. We found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alleles
  • Cell Line
  • Chromosome Mapping*
  • Computational Biology
  • Computer Simulation
  • DNA Methylation*
  • Databases, Genetic
  • Epigenesis, Genetic
  • Gene Expression Regulation
  • Gene Library
  • Genetic Association Studies
  • Genome, Human
  • Genotype
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci
  • Reproducibility of Results
  • Sequence Analysis, DNA

Associated data

  • SRA/SRP045408