HIV Protease and Integrase Empirical Substitution Models of Evolution: Protein-Specific Models Outperform Generalist Models

Genes (Basel). 2021 Dec 27;13(1):61. doi: 10.3390/genes13010061.

Abstract

Diverse phylogenetic methods require a substitution model of evolution that should mimic, as accurately as possible, the real substitution process. At the protein level, empirical substitution models have traditionally been based on a large number of different proteins from particular taxonomic levels. However, these models assume that all of the proteins of a taxonomic level evolve under the same substitution patterns. We believe that this assumption is highly unrealistic and should be relaxed by considering protein-specific substitution models that account for protein-specific selection processes. In order to test this hypothesis, we inferred and evaluated four new empirical substitution models for the protease and integrase of HIV and other viruses. We found that these models more accurately fit, compared with any of the currently available empirical substitution models, the evolutionary process of these proteins. We conclude that evolutionary inferences from protein sequences are more accurate if they are based on protein-specific substitution models rather than taxonomic-specific (generalist) substitution models. We also present four new empirical substitution models of protein evolution that could be useful for phylogenetic inferences of viral protease and integrase.

Keywords: HIV; phylogenetic reconstruction; protein evolution; substitution model of protein evolution; viral integrase; viral protease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Substitution
  • Computer Simulation
  • Evolution, Molecular*
  • HIV Infections / virology*
  • HIV Protease / genetics*
  • HIV Protease / metabolism
  • HIV-1 / enzymology
  • HIV-1 / genetics*
  • Humans
  • Integrases / genetics*
  • Integrases / metabolism
  • Models, Genetic*
  • Models, Statistical*
  • Phylogeny

Substances

  • Integrases
  • HIV Protease