BOOL-AN: a method for comparative sequence analysis and phylogenetic reconstruction

Mol Phylogenet Evol. 2009 Sep;52(3):887-97. doi: 10.1016/j.ympev.2009.04.019. Epub 2009 May 5.

Abstract

A novel discrete mathematical approach is proposed as an additional tool for molecular systematics which does not require prior statistical assumptions concerning the evolutionary process. The method is based on algorithms generating mathematical representations directly from DNA/RNA or protein sequences, followed by the output of numerical (scalar or vector) and visual characteristics (graphs). The binary encoded sequence information is transformed into a compact analytical form, called the Iterative Canonical Form (or ICF) of Boolean functions, which can then be used as a generalized molecular descriptor. The method provides raw vector data for calculating different distance matrices, which in turn can be analyzed by neighbor-joining or UPGMA to derive a tree, or by principal coordinates analysis to get an ordination scattergram. The new method and the associated software for inferring phylogenetic trees are called the Boolean analysis or BOOL-AN.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Models, Genetic
  • Phylogeny*
  • Sequence Analysis, DNA / methods*
  • Software*