Computational and Statistical Analysis of Array-Based DNA Methylation Data

Methods Mol Biol. 2019:1878:173-191. doi: 10.1007/978-1-4939-8868-6_10.

Abstract

The characterization of aberrant DNA methylation is emerging as a key part of the study of cancer development and phenotype. The technical advancements and decreasing costs of methods for high-throughput profiling of DNA methylation have brought about a high interest in the use of such methods in disease association studies. Here we discuss the principles for DNA methylation analysis using data from the Infinium DNA methylation BeadChip assays and describe the computational steps and statistical considerations going from processing of the raw array data to analysis of differential methylation. Moreover, we provide detailed guidelines on how to perform tumor subtype classification based on DNA methylation signatures.

Keywords: 450k array; BeadChip Assay; Cancer; Classification; DNA methylation; Epigenetics; Subtyping.

MeSH terms

  • Computational Biology / methods
  • CpG Islands / genetics
  • DNA Methylation / genetics*
  • Databases, Nucleic Acid
  • Genome, Human / genetics
  • Humans
  • Neoplasms / genetics
  • Phenotype