Enhancer Predictions and Genome-Wide Regulatory Circuits

Annu Rev Genomics Hum Genet. 2020 Aug 31:21:37-54. doi: 10.1146/annurev-genom-121719-010946. Epub 2020 May 22.

Abstract

Spatiotemporal control of gene expression during development requires orchestrated activities of numerous enhancers, which are cis-regulatory DNA sequences that, when bound by transcription factors, support selective activation or repression of associated genes. Proper activation of enhancers is critical during embryonic development, adult tissue homeostasis, and regeneration, and inappropriate enhancer activity is often associated with pathological conditions such as cancer. Multiple consortia [e.g., the Encyclopedia of DNA Elements (ENCODE) Consortium and National Institutes of Health Roadmap Epigenomics Mapping Consortium] and independent investigators have mapped putative regulatory regions in a large number of cell types and tissues, but the sequence determinants of cell-specific enhancers are not yet fully understood. Machine learning approaches trained on large sets of these regulatory regions can identify core transcription factor binding sites and generate quantitative predictions of enhancer activity and the impact of sequence variants on activity. Here, we review these computational methods in the context of enhancer prediction and gene regulatory network models specifying cell fate.

Keywords: cell fate switching; enhancers; gene regulatory networks; machine learning; sequence-based prediction.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Computational Biology / methods*
  • Enhancer Elements, Genetic*
  • Gene Regulatory Networks*
  • Genome, Human*
  • Humans