GRiNCH: simultaneous smoothing and detection of topological units of genome organization from sparse chromatin contact count matrices with matrix factorization

Genome Biol. 2021 May 25;22(1):164. doi: 10.1186/s13059-021-02378-z.

Abstract

High-throughput chromosome conformation capture assays, such as Hi-C, have shown that the genome is organized into organizational units such as topologically associating domains (TADs), which can impact gene regulatory processes. The sparsity of Hi-C matrices poses a challenge for reliable detection of these units. We present GRiNCH, a constrained matrix-factorization-based approach for simultaneous smoothing and discovery of TADs from sparse contact count matrices. GRiNCH shows superior performance against seven TAD-calling methods and three smoothing methods. GRiNCH is applicable to multiple platforms including SPRITE and HiChIP and can predict novel boundary factors with potential roles in genome organization.

Keywords: High-throughput chromosomal conformation capture (Hi-C); Matrix factorization; Three-dimensional (3D) genome organization; Topologically associating domains (TADs).

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Cell Line
  • Chromatin / genetics*
  • Chromosomes / genetics
  • Databases, Genetic
  • Genome*
  • Histones / metabolism
  • Humans
  • Protein Processing, Post-Translational
  • Regulatory Sequences, Nucleic Acid / genetics
  • Time Factors

Substances

  • Chromatin
  • Histones