This work aims at the similarity of biological sequences. Based on the Burrows-Wheeler transform, a definition of Burrows-Wheeler similarity distribution of two sequences is proposed to compare two sequences. Some distance measures are naturally followed by the distribution. The expectation and entropy of the similarity distribution are used to construct phylogenetic trees on two independent data sets. The result demonstrates that the method is efficient and powerful.
(c) 2009 Elsevier Ltd. All rights reserved.