Systematic identification and annotation of human methylation marks based on bisulfite sequencing methylomes reveals distinct roles of cell type-specific hypomethylation in the regulation of cell identity genes

Nucleic Acids Res. 2016 Jan 8;44(1):75-94. doi: 10.1093/nar/gkv1332. Epub 2015 Dec 3.

Abstract

DNA methylation is a key epigenetic mark that is critical for gene regulation in multicellular eukaryotes. Although various human cell types may have the same genome, these cells have different methylomes. The systematic identification and characterization of methylation marks across cell types are crucial to understand the complex regulatory network for cell fate determination. In this study, we proposed an entropy-based framework termed SMART to integrate the whole genome bisulfite sequencing methylomes across 42 human tissues/cells and identified 757 887 genome segments. Nearly 75% of the segments showed uniform methylation across all cell types. From the remaining 25% of the segments, we identified cell type-specific hypo/hypermethylation marks that were specifically hypo/hypermethylated in a minority of cell types using a statistical approach and presented an atlas of the human methylation marks. Further analysis revealed that the cell type-specific hypomethylation marks were enriched through H3K27ac and transcription factor binding sites in cell type-specific manner. In particular, we observed that the cell type-specific hypomethylation marks are associated with the cell type-specific super-enhancers that drive the expression of cell identity genes. This framework provides a complementary, functional annotation of the human genome and helps to elucidate the critical features and functions of cell type-specific hypomethylation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Binding Sites
  • Biomarkers
  • Chromatin / genetics
  • Chromatin / metabolism
  • Cluster Analysis
  • Computational Biology / methods
  • CpG Islands
  • DNA Methylation*
  • Embryonic Stem Cells / metabolism
  • Epigenesis, Genetic*
  • Epigenomics / methods*
  • Gene Expression Profiling
  • Gene Expression Regulation
  • High-Throughput Nucleotide Sequencing
  • Histones / metabolism
  • Humans
  • Nucleotide Motifs
  • Organ Specificity / genetics
  • Transcriptome*

Substances

  • Biomarkers
  • Chromatin
  • Histones