Abundance, diversity and domain architecture variability in prokaryotic DNA-binding transcription factors

PLoS One. 2018 Apr 3;13(4):e0195332. doi: 10.1371/journal.pone.0195332. eCollection 2018.

Abstract

Gene regulation at the transcriptional level is a central process in all organisms, and DNA-binding transcription factors, known as TFs, play a fundamental role. This class of proteins usually binds at specific DNA sequences, activating or repressing gene expression. In general, TFs are composed of two domains: the DNA-binding domain (DBD) and an extra domain, which in this work we have named "companion domain" (CD). This latter could be involved in one or more functions such as ligand binding, protein-protein interactions or even with enzymatic activity. In contrast to DBDs, which have been widely characterized both experimentally and bioinformatically, information on the abundance, distribution, variability and possible role of the CDs is scarce. Here, we investigated these issues associated with the domain architectures of TFs in prokaryotic genomes. To this end, 19 families of TFs in 761 non-redundant bacterial and archaeal genomes were evaluated. In this regard we found four main groups based on the abundance and distribution in the analyzed genomes: i) LysR and TetR/AcrR; ii) AraC/XylS, SinR, and others; iii) Lrp, Fis, ArsR, and others; and iv) a group that included only two families, ArgR and BirA. Based on a classification of the organisms according to the life-styles, a major abundance of regulatory families in free-living organisms, in contrast with pathogenic, extremophilic or intracellular organisms, was identified. Finally, the protein architecture diversity associated to the 19 families considering a weight score for domain promiscuity evidenced which regulatory families were characterized by either a large diversity of CDs, here named as "promiscuous" families given the elevated number of variable domains found in those TFs, or a low diversity of CDs. Altogether this information helped us to understand the diversity and distribution of the 19 Prokaryotes TF families. Moreover, initial steps were taken to comprehend the variability of the extra domain in those TFs, which eventually might assist in evolutionary and functional studies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaeal Proteins / genetics*
  • Archaeal Proteins / metabolism
  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism
  • DNA-Binding Proteins / genetics*
  • DNA-Binding Proteins / metabolism
  • Evolution, Molecular
  • Genetic Variation
  • Genome, Archaeal
  • Genome, Bacterial
  • Protein Conformation
  • Protein Domains
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism

Substances

  • Archaeal Proteins
  • Bacterial Proteins
  • DNA-Binding Proteins
  • Transcription Factors

Grants and funding

JAI was supported by grant 20180647 from Secretaria de Investigacion y Posgrado from Instituto Politecnico Nacional; EP-R and MAM-N were funded by DGAPA-Universidad Nacional Autónoma de México (IN-201117 and IA-205417, respectively). There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.