An Omega(n2/ log n) speed-up of TBR heuristics for the gene-duplication problem

IEEE/ACM Trans Comput Biol Bioinform. 2008 Oct-Dec;5(4):514-24. doi: 10.1109/TCBB.2008.69.

Abstract

The gene-duplication problem is to infer a species supertree from gene trees that are confounded by complex histories of gene duplications. This problem is NP-hard and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. We improve on the time complexity of the local search problem by a factor of n2= log n, where n is the size of the resulting species supertree. Typically, several thousand instances of the local search problem are solved throughout a stepwise heuristic search. Hence, our improvement makes the gene-duplication problem much more tractable for large-scale phylogenetic analyses.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Base Sequence
  • Chromosome Mapping / methods*
  • Gene Duplication*
  • Molecular Sequence Data
  • Sequence Analysis, DNA / methods*