Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization

BMC Bioinformatics. 2007 Jul 27:8:271. doi: 10.1186/1471-2105-8-271.

Abstract

Background: The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account.

Results: We present a graph-based representation for sequence-structure alignments, which we model as an integer linear program (ILP). We sketch how we compute an optimal or near-optimal solution to the ILP using methods from combinatorial optimization, and present results on a recently published benchmark set for RNA alignments.

Conclusion: The implementation of our algorithm yields better alignments in terms of two published scores than the other programs that we tested: This is especially the case with an increasing number of input sequences. Our program LARA is freely available for academic purposes from http://www.planet-lisa.net.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Linear Models
  • Models, Chemical
  • Models, Genetic
  • RNA / chemistry*
  • RNA / genetics*
  • Sensitivity and Specificity
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*
  • Sequence Homology, Nucleic Acid

Substances

  • RNA