Multiple alignment through protein secondary-structure information

IEEE Trans Nanobioscience. 2005 Sep;4(3):207-11. doi: 10.1109/tnb.2005.853644.

Abstract

It is well known that protein secondary-structure information can help the process of performing multiple alignment, in particular when the amount of similarity among the involved sequences moves toward the "twilight zone" (less than 30% of pairwise similarity). In this paper, a multiple alignment algorithm is presented, explicitly designed for exploiting any available secondary-structure information. A layered architecture with two interacting levels has been defined for dealing with both primary- and secondary-structure information of target sequences. Secondary structure (either available or predicted by resorting to a technique based on multiple experts) is used to calculate an initial alignment at the secondary level, to be arranged by locally scoped operators devised to refine the alignment at the primary level. Aimed at evaluating the impact of secondary information on the quality of alignments, in particular alignments with a low degree of similarity, the technique has been implemented and assessed on relevant test cases.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Molecular Sequence Data
  • Protein Conformation
  • Protein Structure, Secondary
  • Proteins / analysis*
  • Proteins / chemistry*
  • Sequence Alignment / methods
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid

Substances

  • Proteins