Lengths of Orthologous Prokaryotic Proteins Are Affected by Evolutionary Factors

Biomed Res Int. 2015:2015:786861. doi: 10.1155/2015/786861. Epub 2015 May 31.

Abstract

Proteins of the same functional family (for example, kinases) may have significantly different lengths. It is an open question whether such variation in length is random or it appears as a response to some unknown evolutionary driving factors. The main purpose of this paper is to demonstrate existence of factors affecting prokaryotic gene lengths. We believe that the ranking of genomes according to lengths of their genes, followed by the calculation of coefficients of association between genome rank and genome property, is a reasonable approach in revealing such evolutionary driving factors. As we demonstrated earlier, our chosen approach, Bubble-sort, combines stability, accuracy, and computational efficiency as compared to other ranking methods. Application of Bubble Sort to the set of 1390 prokaryotic genomes confirmed that genes of Archaeal species are generally shorter than Bacterial ones. We observed that gene lengths are affected by various factors: within each domain, different phyla have preferences for short or long genes; thermophiles tend to have shorter genes than the soil-dwellers; halophiles tend to have longer genes. We also found that species with overrepresentation of cytosines and guanines in the third position of the codon (GC3 content) tend to have longer genes than species with low GC3 content.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Archaea / genetics*
  • Archaeal Proteins / genetics*
  • Bacterial Proteins / genetics*
  • Codon
  • Evolution, Molecular*
  • Genome, Archaeal
  • Genome, Bacterial

Substances

  • Archaeal Proteins
  • Bacterial Proteins
  • Codon