Prokaryote phylogeny without sequence alignment: from avoidance signature to composition distance

Proc IEEE Comput Soc Bioinform Conf. 2003:2:375-84.

Abstract

A new and essentially simple method to reconstruct prokaryotic phylogenetic trees from their complete genome data without using sequence alignment is proposed. It is based on the appearance frequency of oligopeptides of a fixed length (up to K = 6) in their proteomes. This is a method without fine adjustment and choice of genes. It can incorporate the effect of lateral gene transfer to some extent and leads to results comparable with the bacteriologists' systematics as reflected in the latest 2001 edition of the Bergey's Manual of Systematic Bacteriology [1, 2]. A key point in our approach is subtraction of a random background by using a Markovian model of order K - 1 from the composition vectors to highlight the shaping role of natural selection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Composition
  • Base Sequence
  • Computer Simulation
  • DNA Mutational Analysis / methods*
  • DNA, Bacterial / genetics*
  • Models, Genetic*
  • Molecular Sequence Data
  • Phylogeny*
  • Recombination, Genetic / genetics*
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*
  • Signal Transduction / genetics*

Substances

  • DNA, Bacterial