Computational applications of DNA structural scales

Proc Int Conf Intell Syst Mol Biol. 1998:6:35-42.

Abstract

We study from a computational standpoint several different physical scales associated with structural features of DNA sequences, including dinucleotide scales such as base stacking energy and propeller twist, and trinucleotide scales such as bendability and nucleosome positioning. We show that these scales provide an alternative or complementary compact representation of DNA sequences. As an example we construct a strand invariant representation of DNA sequences. The scales can also be used to analyze and discover new DNA structural patterns, especially in combinations with hidden Markov models (HMMs). The scales are applied to HMMs of human promoter sequences revealing a number of significant differences between regions upstream and downstream of the transcriptional start point. Finally we show, with some qualifications, that such scales are by and large independent, and therefore complement each other.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Artificial Intelligence
  • Base Sequence
  • DNA / chemistry*
  • DNA / genetics
  • Humans
  • Markov Chains
  • Molecular Structure
  • Oligodeoxyribonucleotides / chemistry
  • Oligodeoxyribonucleotides / genetics
  • Pattern Recognition, Automated
  • Promoter Regions, Genetic
  • TATA Box
  • Thermodynamics

Substances

  • Oligodeoxyribonucleotides
  • DNA