The Burrows-Wheeler similarity distribution between biological sequences based on Burrows-Wheeler transform

J Theor Biol. 2010 Feb 21;262(4):742-9. doi: 10.1016/j.jtbi.2009.10.033. Epub 2009 Nov 10.

Abstract

This work aims at the similarity of biological sequences. Based on the Burrows-Wheeler transform, a definition of Burrows-Wheeler similarity distribution of two sequences is proposed to compare two sequences. Some distance measures are naturally followed by the distribution. The expectation and entropy of the similarity distribution are used to construct phylogenetic trees on two independent data sets. The result demonstrates that the method is efficient and powerful.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods
  • DNA, Mitochondrial / genetics
  • Drug Design
  • Entropy
  • Genetic Techniques
  • Humans
  • Membrane Proteins / chemistry
  • Models, Genetic
  • Models, Statistical
  • Mutation
  • Phylogeny
  • Protein Folding
  • Sequence Alignment / methods*
  • Sequence Analysis / methods*
  • Transferrin / genetics

Substances

  • DNA, Mitochondrial
  • Membrane Proteins
  • Transferrin