SeSAMe: reducing artifactual detection of DNA methylation by Infinium BeadChips in genomic deletions

Nucleic Acids Res. 2018 Nov 16;46(20):e123. doi: 10.1093/nar/gky691.

Abstract

We report a new class of artifacts in DNA methylation measurements from Illumina HumanMethylation450 and MethylationEPIC arrays. These artifacts reflect failed hybridization to target DNA, often due to germline or somatic deletions and manifest as incorrectly reported intermediate methylation. The artifacts often survive existing preprocessing pipelines, masquerade as epigenetic alterations and can confound discoveries in epigenome-wide association studies and studies of methylation-quantitative trait loci. We implement a solution, P-value with out-of-band (OOB) array hybridization (pOOBAH), in the R package SeSAMe. Our method effectively masks deleted and hyperpolymorphic regions, reducing or eliminating spurious reports of epigenetic silencing at oft-deleted tumor suppressor genes such as CDKN2A and RB1 in cases with somatic deletions. Furthermore, our method substantially decreases technical variation whilst retaining biological variation, both within and across HM450 and EPIC platform measurements. SeSAMe provides a light-weight, modular DNA methylation data analysis suite, with a performant implementation suitable for efficient analysis of thousands of samples.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artifacts*
  • Benchmarking
  • DNA Methylation*
  • DNA Probes
  • DNA, Neoplasm / genetics
  • Datasets as Topic
  • Gene Dosage
  • Genome
  • Germ-Line Mutation
  • High-Throughput Screening Assays / instrumentation*
  • Humans
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis*
  • Polymorphism, Genetic
  • Quantitative Trait Loci
  • Reproducibility of Results
  • Sequence Deletion*
  • Software*

Substances

  • DNA Probes
  • DNA, Neoplasm