Methodological aspects of whole-genome bisulfite sequencing analysis

Brief Bioinform. 2015 May;16(3):369-79. doi: 10.1093/bib/bbu016. Epub 2014 May 27.

Abstract

The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation beyond CpG sites and CpG islands. These technologies have opened new avenues to understand the interplay between epigenetic events, chromatin plasticity and gene regulation. However, the processing, managing and mining of this huge volume of data require specialized computational tools and statistical methods that are yet to be standardized. Here, we describe a complete bisulfite sequencing analysis workflow, including recently developed programs, highlighting each of the crucial analysis steps required, i.e. sequencing quality control, reads alignment, methylation scoring, methylation heterogeneity assessment, genomic features annotation, data visualization and determination of differentially methylated cytosines. Moreover, we discuss the limitations of these technologies and considerations to perform suitable analyses.

Keywords: DNA methylation; bisulfite sequencing; epigenetics; high-throughput sequencing.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms*
  • Base Sequence
  • Chromosome Mapping / methods*
  • CpG Islands / genetics*
  • DNA / chemistry
  • DNA / genetics*
  • DNA Methylation / genetics
  • High-Throughput Nucleotide Sequencing / methods*
  • Molecular Sequence Data
  • Sequence Analysis, DNA / methods*
  • Sulfites / chemistry

Substances

  • Sulfites
  • DNA
  • hydrogen sulfite