An Omega(n2/ log n) speed-up of TBR heuristics for the gene-duplication problem

Mukul S Bansal; Oliver Eulenstein

doi:10.1109/TCBB.2008.69

An Omega(n2/ log n) speed-up of TBR heuristics for the gene-duplication problem

IEEE/ACM Trans Comput Biol Bioinform. 2008 Oct-Dec;5(4):514-24. doi: 10.1109/TCBB.2008.69.

Authors

Mukul S Bansal¹, Oliver Eulenstein

Affiliation

¹ Department of Computer Science, Iowa State University, Ames, IA 50011, USA.

PMID: 18989039
DOI: 10.1109/TCBB.2008.69

Abstract

The gene-duplication problem is to infer a species supertree from gene trees that are confounded by complex histories of gene duplications. This problem is NP-hard and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. We improve on the time complexity of the local search problem by a factor of n2= log n, where n is the size of the resulting species supertree. Typically, several thousand instances of the local search problem are solved throughout a stepwise heuristic search. Hence, our improvement makes the gene-duplication problem much more tractable for large-scale phylogenetic analyses.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Base Sequence
Chromosome Mapping / methods*
Gene Duplication*
Molecular Sequence Data
Sequence Analysis, DNA / methods*