A combined empirical and mechanistic codon model

Mol Biol Evol. 2007 Feb;24(2):388-97. doi: 10.1093/molbev/msl175. Epub 2006 Nov 16.

Abstract

The evolutionary selection forces acting on a protein are commonly inferred using evolutionary codon models by contrasting the rate of synonymous to nonsynonymous substitutions. Most widely used models are based on theoretical assumptions and ignore the empirical observation that distinct amino acids differ in their replacement rates. In this paper, we develop a general method that allows assimilation of empirical amino acid replacement probabilities into a codon-substitution matrix. In this way, the resulting codon model takes into account not only the transition-transversion bias and the nonsynonymous/synonymous ratio, but also the different amino acid replacement probabilities as specified in empirical amino acid matrices. Different empirical amino acid replacement matrices, such as secondary structure-specific matrices or organelle-specific matrices (e.g., mitochondria and chloroplasts), can be incorporated into the model, making it context dependent. Using a diverse set of coding DNA sequences, we show that the novel model better fits biological data as compared with either mechanistic or empirical codon models. Using the suggested model, we further analyze human immunodeficiency virus type 1 protease sequences obtained from drug-treated patients and reveal positive selection in sites that are known to confer drug resistance to the virus.

MeSH terms

  • Amino Acid Substitution
  • Animals
  • Anti-HIV Agents / therapeutic use
  • Carbamates / therapeutic use
  • Chloroplasts / genetics
  • Codon*
  • Drug Resistance, Viral
  • Evolution, Molecular
  • Furans
  • Genes, Mitochondrial
  • Genes, Viral
  • HIV Infections / drug therapy
  • HIV Protease / chemistry
  • HIV Protease / genetics
  • HIV-1 / enzymology
  • Humans
  • Markov Chains
  • Models, Genetic*
  • Models, Statistical*
  • Mutation
  • Selection, Genetic
  • Sulfonamides / therapeutic use

Substances

  • Anti-HIV Agents
  • Carbamates
  • Codon
  • Furans
  • Sulfonamides
  • amprenavir
  • HIV Protease