Improving accuracy of multiple sequence alignment algorithms based on alignment of neighboring residues

Nucleic Acids Res. 2009 Feb;37(2):463-72. doi: 10.1093/nar/gkn945. Epub 2008 Dec 4.

Abstract

While most of the recent improvements in multiple sequence alignment accuracy are due to better use of vertical information, which include the incorporation of consistency-based pairwise alignments and the use of profile alignments, we observe that it is possible to further improve accuracy by taking into account alignment of neighboring residues when aligning two residues, thus making better use of horizontal information. By modifying existing multiple alignment algorithms to make use of horizontal information, we show that this strategy is able to consistently improve over existing algorithms on a few sets of benchmark alignments that are commonly used to measure alignment accuracy, and the average improvements in accuracy can be as much as 1-3% on protein sequence alignment and 5-10% on DNA/RNA sequence alignment. Unlike previous algorithms, consistent average improvements can be obtained across all identity levels.

Publication types

  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Reproducibility of Results
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein
  • Sequence Analysis, RNA