Substitution Models of Protein Evolution with Selection on Enzymatic Activity

Mol Biol Evol. 2024 Feb 1;41(2):msae026. doi: 10.1093/molbev/msae026.

Abstract

Substitution models of evolution are necessary for diverse evolutionary analyses including phylogenetic tree and ancestral sequence reconstructions. At the protein level, empirical substitution models are traditionally used due to their simplicity, but they ignore the variability of substitution patterns among protein sites. Next, in order to improve the realism of the modeling of protein evolution, a series of structurally constrained substitution models were presented, but still they usually ignore constraints on the protein activity. Here, we present a substitution model of protein evolution with selection on both protein structure and enzymatic activity, and that can be applied to phylogenetics. In particular, the model considers the binding affinity of the enzyme-substrate complex as well as structural constraints that include the flexibility of structural flaps, hydrogen bonds, amino acids backbone radius of gyration, and solvent-accessible surface area that are quantified through molecular dynamics simulations. We applied the model to the HIV-1 protease and evaluated it by phylogenetic likelihood in comparison with the best-fitting empirical substitution model and a structurally constrained substitution model that ignores the enzymatic activity. We found that accounting for selection on the protein activity improves the fitting of the modeled functional regions with the real observations, especially in data with high molecular identity, which recommends considering constraints on the protein activity in the development of substitution models of evolution.

Keywords: molecular dynamics simulations; molecular evolution; protein evolution; protein function; protein phylogenetics; substitution model.

MeSH terms

  • Amino Acid Substitution
  • Amino Acids*
  • Evolution, Molecular*
  • Models, Genetic
  • Phylogeny
  • Probability

Substances

  • Amino Acids