Solvable models of neighbor-dependent substitution processes

Math Biosci. 2008 Jan;211(1):56-88. doi: 10.1016/j.mbs.2007.10.001. Epub 2007 Oct 11.

Abstract

We prove that a wide class of Markov models of neighbor-dependent substitution processes on the integer line is solvable. This class contains some models of nucleotidic substitutions recently introduced and studied empirically by molecular biologists. We show that the polynucleotidic frequencies at equilibrium solve some finite-size linear systems. This provides, for the first time up to our knowledge, explicit and algebraic formulas for the stationary frequencies of non-degenerate neighbor-dependent models of DNA substitutions. Furthermore, we show that the dynamics of these stochastic processes and their distribution at equilibrium exhibit some stringent, rather unexpected, independence properties. For example, nucleotidic sites at distance at least three evolve independently, and all the sites, when encoded as purines and pyrimidines, evolve independently.

MeSH terms

  • Algorithms
  • Base Composition
  • Base Sequence
  • Evolution, Molecular
  • Markov Chains*
  • Models, Genetic*
  • Point Mutation / genetics*
  • Poisson Distribution
  • Purine Nucleotides / genetics
  • Pyrimidine Nucleotides / genetics

Substances

  • Purine Nucleotides
  • Pyrimidine Nucleotides