A More Practical Algorithm for the Rooted Triplet Distance

J Comput Biol. 2017 Feb;24(2):106-126. doi: 10.1089/cmb.2016.0185. Epub 2016 Dec 16.

Abstract

The rooted triplet distance is a measure of the dissimilarity of two phylogenetic trees with identical leaf label sets. An algorithm by Brodal et al. that computes it in [Formula: see text] time and [Formula: see text] space, where n is the number of leaf labels, has recently been implemented in the software package tqDist. In this article, we show that replacing the hierarchical decomposition tree used in Brodal et al.'s algorithm by a centroid paths-based data structure yields an [Formula: see text]-time and [Formula: see text]-space algorithm that, although slower in theory, is faster in practice as well as less memory consuming. Simulations for values of n up to 4,000,000 support our claims experimentally.

Keywords: centroid path decomposition tree; implementation; phylogenetic tree comparison; rooted triplet distance.

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Models, Genetic*
  • Phylogeny*
  • Software