Protein alignment: Exact versus approximate. An illustration

J Comput Chem. 2015 May 30;36(14):1069-74. doi: 10.1002/jcc.23892. Epub 2015 Mar 19.

Abstract

We illustrate solving the protein alignment problem exactly using the algorithm VESPA (very efficient search for protein alignment). We have compared our result with the approximate solution obtained with BLAST (basic local alignment search tool) software, which is currently the most widely used for searching for protein alignment. We have selected human and mouse proteins having around 170 amino acids for comparison. The exact solution has found 78 pairs of amino acids, to which one should add 17 individual amino acid alignments giving a total of 95 aligned amino acids. BLAST has identified 64 aligned amino acids which involve pairs of more than two adjacent amino acids. However, the difference between the two outputs is not as large as it may appear, because a number of amino acids that are adjacent have been reported by BLAST as single amino acids. So if one counts all amino acids, whether isolated (single) or in a group of two and more amino acids, then the count for BLAST is 89 and for VESPA is 95, a difference of only six.

Keywords: BLAST; VESPA; protein alignment; sequential amino acid matrix.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Databases, Factual*
  • Humans
  • Mice
  • Proteins / chemistry*
  • Sequence Alignment / methods*

Substances

  • Proteins