A 3D pattern matching algorithm for DNA sequences

Bioinformatics. 2007 Mar 15;23(6):680-6. doi: 10.1093/bioinformatics/btl669. Epub 2007 Jan 18.

Abstract

Motivation: Biologists usually work with textual DNA sequences (succession of A, C, G and T). This representation allows biologists to study the syntax and other linguistic properties of DNA sequences. Nevertheless, such a linear coding offers only a local and a one-dimensional vision of the molecule. The 3D structure of DNA is known to be very important in many essential biological mechanisms. By using 3D conformation models, one is able to construct a 3D trajectory of a naked DNA molecule. From the various studies that we performed, it turned out that two very different textual DNA sequences could have similar 3D structures.

Results: In this article, we address a new research work on 3D pattern matching for DNA sequences. The aim of this work is to enhance conventional pattern matching analyses with 3D-augmented criteria. We have developed an algorithm, based on 3D trajectories, which compares angles formed by these trajectories and thus quantifies the difference between two 3D DNA sequences. This analysis performs from a global scale to al local one.

Availability: Available on request from the authors.

MeSH terms

  • Algorithms*
  • Base Sequence
  • Computer Simulation
  • DNA / chemistry*
  • DNA / ultrastructure*
  • Imaging, Three-Dimensional
  • Models, Chemical*
  • Models, Molecular*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Sequence Analysis, DNA / methods*

Substances

  • DNA