Integrative construction of regulatory region networks in 127 human reference epigenomes by matrix factorization

Nucleic Acids Res. 2019 Aug 22;47(14):7235-7246. doi: 10.1093/nar/gkz538.

Abstract

Despite large experimental and computational efforts aiming to dissect the mechanisms underlying disease risk, mapping cis-regulatory elements to target genes remains a challenge. Here, we introduce a matrix factorization framework to integrate physical and functional interaction data of genomic segments. The framework was used to predict a regulatory network of chromatin interaction edges linking more than 20 000 promoters and 1.8 million enhancers across 127 human reference epigenomes, including edges that are present in any of the input datasets. Our network integrates functional evidence of correlated activity patterns from epigenomic data and physical evidence of chromatin interactions. An important contribution of this work is the representation of heterogeneous data with different qualities as networks. We show that the unbiased integration of independent data sources suggestive of regulatory interactions produces meaningful associations supported by existing functional and physical evidence, correlating with expected independent biological features.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Chromatin / genetics
  • Chromatin / metabolism
  • Computational Biology / methods*
  • Epigenomics / methods*
  • Gene Expression Profiling / methods
  • Gene Ontology
  • Gene Regulatory Networks*
  • Humans
  • K562 Cells
  • Polymorphism, Single Nucleotide
  • Promoter Regions, Genetic / genetics*
  • Protein Interaction Mapping / methods
  • Regulatory Elements, Transcriptional / genetics*
  • Reproducibility of Results

Substances

  • Chromatin