A Thermodynamic Atlas of Proteomes Reveals Energetic Innovation across the Tree of Life

Mol Biol Evol. 2022 Mar 2;39(3):msac010. doi: 10.1093/molbev/msac010.

Abstract

Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies are unknown. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals: 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria; 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome; 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C); and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/ (last accessed January 18, 2022).

Keywords: bioinformatics; evolution; protein stability; proteome; thermodynamics.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Archaea* / genetics
  • Archaea* / metabolism
  • Bacteria / genetics
  • Eukaryota / genetics
  • Proteome* / genetics
  • Thermodynamics

Substances

  • Proteome