Scaling of Protein Function across the Tree of Life

Genome Biol Evol. 2023 Dec 1;15(12):evad214. doi: 10.1093/gbe/evad214.

Abstract

Scaling laws are a powerful way to compare genomes because they put all organisms onto a single curve and reveal nontrivial generalities as genomes change in size. The abundance of functional categories across genomes has previously been found to show power law scaling with respect to the total number of functional categories, suggesting that universal constraints shape genomic category abundance. Here, we look across the tree of life to understand how genome evolution may be related to functional scaling. We revisit previous observations of functional genome scaling with an expanded taxonomy by analyzing 3,726 bacterial, 220 archaeal, and 79 unicellular eukaryotic genomes. We find that for some functional classes, scaling is best described by multiple exponents, revealing previously unobserved shifts in scaling as genome-encoded protein annotations increase or decrease. Furthermore, we find that scaling varies between phyletic groups at both the domain and phyla levels and is less universal than previously thought. This variability in functional scaling is not related to taxonomic phylogeny resolved at the phyla level, suggesting that differences in cell plan or physiology outweigh broad patterns of taxonomic evolution. Since genomes are maintained and replicated by the functional proteins encoded by them, these results point to functional degeneracy between taxonomic groups and unique evolutionary trajectories toward these. We also find that individual phyla frequently span scaling exponents of functional classes, revealing that individual clades can move across scaling exponents. Together, our results reveal unique shifts in functions across the tree of life and highlight that as genomes grow or shrink, proteins of various functions may be added or lost.

Keywords: evolution; genome; major transitions; proteins; scaling laws.

MeSH terms

  • Bacteria / genetics
  • Genome*
  • Genomics
  • Phylogeny
  • Proteins* / genetics

Substances

  • Proteins