Random walk and gap plots of DNA sequences

Comput Appl Biosci. 1995 Oct;11(5):503-7. doi: 10.1093/bioinformatics/11.5.503.

Abstract

Genomic sequence analysis is usually performed with the help of specialized software packages written for molecular biologists. The scope of such pre-programmed techniques is quite limited. Because DNA sequences contain a large amount of information, analysis of such sequences without underlying assumptions may provide additional insights. The present article proposes two new graphical representations as examples of such methods. The random walk plot is designed to show the base composition in a compact form, whereas the gap plot visualizes positional correlations. The random walk plot represents the DNA sequence as a curve, a random walk, in a plane. The four possible moves, left/right and up/down, are used to encode the four possible bases. Gap plots provide a tool to exhibit various features in a sequence. They visualize the periodic patterns within a sequence, both with regard to a single type of base or between two types of bases.

MeSH terms

  • Base Composition
  • Base Sequence
  • Biometry / methods*
  • Computer Graphics
  • DNA / chemistry
  • DNA / genetics*
  • Genetic Techniques
  • Molecular Sequence Data
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / statistics & numerical data
  • Software*

Substances

  • DNA