Using multiple relaxed-clock models to estimate evolutionary timescales from DNA sequence data

Mol Phylogenet Evol. 2014 Aug:77:65-70. doi: 10.1016/j.ympev.2014.04.010. Epub 2014 Apr 18.

Abstract

Molecular data sets comprising DNA sequences from multiple genes are now commonplace in phylogenetic studies. These data sets should be analysed using methods that can account for heterogeneity in the molecular evolutionary process among genes. A common problem is determining how many evolutionary models should be applied to subsets of the data. Different genes can exhibit differing patterns of rate variation among lineages, making it appropriate to assign a separate molecular-clock model to each subset of the data (i.e., a 'partitioned' clock model). The impact of clock-partitioning on estimates of evolutionary rates and timescales is largely unknown. In this study, we use a recently developed method, ClockstaR, to evaluate the effect of using different clock-partitioning schemes. We conduct Bayesian phylogenetic analyses of simulated and empirical multigene data sets. Our analyses show strong statistical support for the clock-partitioning scheme chosen by ClockstaR. In addition, we find that the optimal clock-partitioning scheme produces more reliable estimates of node ages than other schemes.

Keywords: Bayesian model selection; Divergence times; Molecular clock; Relaxed clock.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Evolution, Molecular*
  • Models, Genetic*
  • Sequence Analysis, DNA