Translating DNA data tables into quasi-median networks for parsimony analysis and error detection

Mol Phylogenet Evol. 2007 Jan;42(1):256-71. doi: 10.1016/j.ympev.2006.07.013. Epub 2006 Jul 26.

Abstract

Every DNA data table can be turned into a quasi-median network that faithfully represents the data. We show that for (weighted) condensed data tables the associated network harbors all most parsimonious reconstructions for any tree that connects the sampled haplotypes. Structural features of this network can be computed directly from the data table. The key principle repeatedly used is that the quasi-median network is uniquely determined by the sub-tables for pairs of characters. The translation of a table into a network enhances the understanding of the properties of the data in regard to homoplasy and potential artifacts. The total number of nodes of such a network measures the complexity of the data. In particular, networks that display the results of filter analyses by which hotspot mutations are removed help to detect data idiosyncrasies and thus pinpoint sequencing problems. A pertinent example drawn from human mtDNA illustrates these points.

MeSH terms

  • Animals
  • DNA / genetics*
  • DNA, Mitochondrial / genetics
  • Genetic Variation
  • Haplotypes
  • Humans
  • Models, Genetic
  • Phylogeny*
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Mitochondrial
  • DNA