Mapping DNA sequence to transcription factor binding energy in vivo

PLoS Comput Biol. 2019 Feb 4;15(2):e1006226. doi: 10.1371/journal.pcbi.1006226. eCollection 2019 Feb.

Abstract

Despite the central importance of transcriptional regulation in biology, it has proven difficult to determine the regulatory mechanisms of individual genes, let alone entire gene networks. It is particularly difficult to decipher the biophysical mechanisms of transcriptional regulation in living cells and determine the energetic properties of binding sites for transcription factors and RNA polymerase. In this work, we present a strategy for dissecting transcriptional regulatory sequences using in vivo methods (massively parallel reporter assays) to formulate quantitative models that map a transcription factor binding site's DNA sequence to transcription factor-DNA binding energy. We use these models to predict the binding energies of transcription factor binding sites to within 1 kBT of their measured values. We further explore how such a sequence-energy mapping relates to the mechanisms of trancriptional regulation in various promoter contexts. Specifically, we show that our models can be used to design specific induction responses, analyze the effects of amino acid mutations on DNA sequence preference, and determine how regulatory context affects a transcription factor's sequence specificity.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Binding Sites / genetics*
  • Chromosome Mapping
  • Computational Biology / methods*
  • DNA / chemistry
  • Gene Expression Regulation / genetics
  • Gene Regulatory Networks
  • Models, Molecular
  • Promoter Regions, Genetic / genetics
  • Protein Binding
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / chemistry
  • Transcription Factors / metabolism
  • Transcription, Genetic / physiology

Substances

  • Transcription Factors
  • DNA