The Shape of Phylogenies Under Phase-Type Distributed Times to Speciation and Extinction

Bull Math Biol. 2022 Sep 14;84(10):118. doi: 10.1007/s11538-022-01072-w.

Abstract

Phylogenetic trees describe relationships between extant species, but beyond that their shape and their relative branch lengths can provide information on broader evolutionary processes of speciation and extinction. However, currently many of the most widely used macro-evolutionary models make predictions about the shapes of phylogenetic trees that differ considerably from what is observed in empirical phylogenies. Here, we propose a flexible and biologically plausible macroevolutionary model for phylogenetic trees where times to speciation or extinction events are drawn from a Coxian phase-type (PH) distribution. First, we show that different choices of parameters in our model lead to a range of tree balances as measured by Aldous' [Formula: see text] statistic. In particular, we demonstrate that it is possible to find parameters that correspond well to empirical tree balance. Next, we provide a natural extension of the [Formula: see text] statistic to sets of trees. This extension produces less biased estimates of [Formula: see text] compared to using the median [Formula: see text] values from individual trees. Furthermore, we derive a likelihood expression for the probability of observing an edge-weighted tree under a model with speciation but no extinction. Finally, we illustrate the application of our model by performing both absolute and relative goodness-of-fit tests for two large empirical phylogenies (squamates and angiosperms) that compare models with Coxian PH distributed times to speciation with models that assume exponential or Weibull distributed waiting times. In our numerical analysis, we found that, in most cases, models assuming a Coxian PH distribution provided the best fit.

Keywords: Diversification; Macro-evolutionary model; Phase-type distribution; Tree balance.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Evolution
  • Mathematical Concepts*
  • Models, Biological*
  • Phylogeny
  • Probability

Associated data

  • Dryad/10.5061/dryad.w9ghx3fpk