Approaches for the Analysis and Interpretation of Whole Genome Bisulfite Sequencing Data

Methods Mol Biol. 2018:1767:299-310. doi: 10.1007/978-1-4939-7774-1_17.

Abstract

DNA methylation is a covalent modification of DNA that plays important roles in processes such as the regulation of gene expression, transcription factor binding, and suppression of transposable elements. The use of whole genome bisulfite sequencing (WGBS) enables the genome-wide identification and quantification of DNA methylation patterns at single-base resolution and is the gold standard for analysis of DNA methylation. Computational analysis of WGBS data can be particularly challenging, as many computationally intensive steps are required. Here, we outline a step-by-step approach for the analysis and interpretation of WGBS data. First, sequencing reads must be trimmed, quality checked, and aligned to the genome. Second, DNA methylation levels are estimated at each cytosine position using the aligned sequence reads of the bisulfite treated DNA. Third, regions of differential cytosine methylation between samples can be identified. Finally, these data need to be visualized and interpreted in the context of the biological question at hand.

Keywords: Bioinformatics; DNA methylation; Genomics; Whole genome bisulfite sequencing.

MeSH terms

  • Animals
  • DNA / analysis
  • DNA / genetics
  • DNA Methylation*
  • Genomics / methods
  • Humans
  • Sequence Analysis, DNA / methods
  • Software*
  • Sulfites / chemistry
  • Whole Genome Sequencing / methods*

Substances

  • Sulfites
  • DNA
  • hydrogen sulfite