Validating regulatory predictions from diverse bacteria with mutant fitness data

PLoS One. 2017 May 24;12(5):e0178258. doi: 10.1371/journal.pone.0178258. eCollection 2017.

Abstract

Although transcriptional regulation is fundamental to understanding bacterial physiology, the targets of most bacterial transcription factors are not known. Comparative genomics has been used to identify likely targets of some of these transcription factors, but these predictions typically lack experimental support. Here, we used mutant fitness data, which measures the importance of each gene for a bacterium's growth across many conditions, to test regulatory predictions from RegPrecise, a curated collection of comparative genomics predictions. Because characterized transcription factors often have correlated fitness with one of their targets (either positively or negatively), correlated fitness patterns provide support for the comparative genomics predictions. At a false discovery rate of 3%, we identified significant cofitness for at least one target of 158 TFs in 107 ortholog groups and from 24 bacteria. Thus, high-throughput genetics can be used to identify a high-confidence subset of the sequence-based regulatory predictions.

Publication types

  • Validation Study

MeSH terms

  • Bacteria / genetics*
  • Bacteria / metabolism*
  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism*
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism
  • Gene Expression Regulation, Bacterial
  • Genetic Fitness
  • High-Throughput Nucleotide Sequencing
  • Models, Genetic*
  • Mutation
  • Shewanella / genetics
  • Shewanella / metabolism
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism*

Substances

  • Bacterial Proteins
  • Escherichia coli Proteins
  • Transcription Factors

Grants and funding

This material by ENIGMA- Ecosystems and Networks Integrated with Genes and Molecular Assemblies (http://enigma.lbl.gov), a Scientific Focus Area Program at Lawrence Berkeley National Laboratory is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Biological & Environmental Research under contract number DE-AC02-05CH11231. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.