A model-based approach for detecting coevolving positions in a molecule

Mol Biol Evol. 2005 Sep;22(9):1919-28. doi: 10.1093/molbev/msi183. Epub 2005 Jun 8.

Abstract

We present a new method for detecting coevolving sites in molecules. The method relies on a set of aligned sequences (nucleic acid or protein) and uses Markov models of evolution to map the substitutions that occurred at each site onto the branches of the underlying phylogenetic tree. This mapping takes into account the uncertainty over ancestral states and among-site rate variation. We then build, for each site, a "substitution vector" containing the posterior estimates of the number of substitutions in each branch. The amount of coevolution for a pair of sites is then measured as the Pearson correlation coefficient between the two corresponding substitution vectors and compared to the expectation under the null hypothesis of independence. We applied the method to a 79-species bacterial ribosomal RNA data set, for which extensive structural characterization has been done over the last 30 years. More than 95% of the intramolecular predicted pairs of sites correspond to known interacting site pairs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Bayes Theorem*
  • Computer Simulation*
  • Escherichia coli
  • Evolution, Molecular*
  • Genetic Variation
  • Markov Chains
  • Models, Genetic*
  • Protein Conformation
  • RNA, Ribosomal*
  • Selection, Genetic

Substances

  • RNA, Ribosomal