Gene Tree Discord, Simplex Plots, and Statistical Tests under the Coalescent

Syst Biol. 2022 Jun 16;71(4):929-942. doi: 10.1093/sysbio/syab008.

Abstract

A simple graphical device, the simplex plot of quartet concordance factors, is introduced to aid in the exploration of a collection of gene trees on a common set of taxa. A single plot summarizes all gene tree discord and allows for visual comparison to the expected discord from the multispecies coalescent model (MSC) of incomplete lineage sorting on a species tree. A formal statistical procedure is described that can quantify the deviation from expectation for each subset of four taxa, suggesting when the data are not in accord with the MSC, and thus that either gene tree inference error is substantial or a more complex model such as that on a network may be required. If the collection of gene trees is in accord with the MSC, the plots reveal when substantial incomplete lineage sorting is present. Applications to both simulated and empirical multilocus data sets illustrate the insights provided. [Gene tree discordance; hypothesis test; multispecies coalescent model; quartet concordance factor; simplex plot; species tree].

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Genetic Speciation*
  • Models, Genetic*
  • Phylogeny

Associated data

  • Dryad/10.5061/dryad.34tmpg4hq