Methods of combinatorial optimization to reveal factors affecting gene length

Bioinform Biol Insights. 2012:6:317-27. doi: 10.4137/BBI.S10525. Epub 2012 Dec 10.

Abstract

In this paper we present a novel method for genome ranking according to gene lengths. The main outcomes described in this paper are the following: the formulation of the genome ranking problem, presentation of relevant approaches to solve it, and the demonstration of preliminary results from prokaryotic genomes ordering. Using a subset of prokaryotic genomes, we attempted to uncover factors affecting gene length. We have demonstrated that hyperthermophilic species have shorter genes as compared with mesophilic organisms, which probably means that environmental factors affect gene length. Moreover, these preliminary results show that environmental factors group together in ranking evolutionary distant species.

Keywords: adaptation; clustering; dimension-reduction techniques; evolution of prokaryotes; factor analysis; machine learning; orthologs; ranking; rating.