HERON: A Novel Tool Enables Identification of Long, Weakly Enriched Genomic Domains in ChIP-seq Data

Int J Mol Sci. 2021 Jul 29;22(15):8123. doi: 10.3390/ijms22158123.

Abstract

The explosive development of next-generation sequencing-based technologies has allowed us to take an unprecedented look at many molecular signatures of the non-coding genome. In particular, the ChIP-seq (Chromatin ImmunoPrecipitation followed by sequencing) technique is now very commonly used to assess the proteins associated with different non-coding DNA regions genome-wide. While the analysis of such data related to transcription factor binding is relatively straightforward, many modified histone variants, such as H3K27me3, are very important for the process of gene regulation but are very difficult to interpret. We propose a novel method, called HERON (HiddEn MaRkov mOdel based peak calliNg), for genome-wide data analysis that is able to detect DNA regions enriched for a certain feature, even in difficult settings of weakly enriched long DNA domains. We demonstrate the performance of our method both on simulated and experimental data.

Keywords: ChIP-seq; histone methylation; peak calling.

MeSH terms

  • Adult
  • Algorithms
  • Chromatin Immunoprecipitation Sequencing / methods*
  • DNA / genetics*
  • DNA / metabolism*
  • Gene Expression
  • Gene Expression Regulation
  • Genome, Human*
  • Hippocampus / embryology
  • Hippocampus / metabolism
  • Histone Code / genetics
  • Histones / genetics*
  • Histones / metabolism*
  • Humans
  • Liver / metabolism
  • Methylation
  • Normal Distribution
  • Protein Binding

Substances

  • Histones
  • DNA