Genomes with distinct function composition

FEBS Lett. 1996 Jun 24;389(1):96-101. doi: 10.1016/0014-5793(96)00527-3.

Abstract

The functional composition of organisms can be analysed for the first time with the appearance of complete or sizeable parts of various genomes. We have reduced the problem of protein function classification to a simple scheme with three classes of protein function: energy-, information- and communication-associated proteins. Finer classification schemes can be easily mapped to the above three classes. To deal with the vast amount of information, a system for automatic function classification using database annotations has been developed. The system is able to classify correctly about 80% of the query sequences with annotations. Using this system, we can analyse samples from the genomes of the most represented species in sequence databases and compare their genomic composition. The similarities and differences for different taxonomic groups are strikingly intuitive. Viruses have the highest proportion of proteins involved in the control and expression of genetic information. Bacteria have the highest proportion of their genes dedicated to the production of proteins associated with small molecule transformations and transport. Animals have a very large proportion of proteins associated with intra- and intercellular communication and other regulatory processes. In general, the proportion of communication-related proteins increases during evolution, indicating trends that led to the emergence of the eukaryotic cell and later the transition from unicellular to multicellular organisms.

Publication types

  • Review

MeSH terms

  • Animals
  • Evolution, Molecular
  • Genes, Plant
  • Genome*
  • Genome, Bacterial
  • Genome, Viral
  • Humans
  • Proteins / classification
  • Proteins / genetics
  • Proteins / physiology*

Substances

  • Proteins