Pure multiple RNA secondary structure alignments: a progressive profile approach

IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):53-62. doi: 10.1109/TCBB.2004.11.

Abstract

In functional, noncoding RNA, structure is often essential to function. While the full 3D structure is very difficult to determine, the 2D structure of an RNA molecule gives good clues to its 3D structure, and for molecules of moderate length, it can be predicted with good reliability. Structure comparison is, in analogy to sequence comparison, the essential technique to infer related function. We provide a method for computing multiple alignments of RNA secondary structures under the tree alignment model, which is suitable to cluster RNA molecules purely on the structural level, i.e., sequence similarity is not required. We give a systematic generalization of the profile alignment method from strings to trees and forests. We introduce a tree profile representation of RNA secondary structure alignments which allows reasonable scoring in structure comparison. Besides the technical aspects, an RNA profile is a useful data structure to represent multiple structures of RNA sequences. Moreover, we propose a visualization of RNA consensus structures that is enriched by the full sequence information.

MeSH terms

  • 5' Untranslated Regions / chemistry
  • 5' Untranslated Regions / genetics
  • Algorithms
  • Animals
  • Base Sequence
  • Computational Biology / methods*
  • Ferritins / genetics
  • Humans
  • Mice
  • Models, Molecular*
  • Molecular Sequence Data
  • Nucleic Acid Conformation*
  • RNA / chemistry*
  • RNA / genetics
  • RNA, Transfer / chemistry
  • RNA, Transfer / genetics
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid

Substances

  • 5' Untranslated Regions
  • RNA
  • Ferritins
  • RNA, Transfer