Databases, models, and algorithms for functional genomics: a bioinformatics perspective

Mol Biotechnol. 2005 Feb;29(2):165-83. doi: 10.1385/MB:29:2:165.

Abstract

A variety of patterns have been observed on the DNA and protein sequences that serve as control points for gene expression and cellular functions. Owing to the vital role of such patterns discovered on biological sequences, they are generally cataloged and maintained within internationally shared databases. Furthermore,the variability in a family of observed patterns is often represented using computational models in order to facilitate their search within an uncharacterized biological sequence. As the biological data is comprised of a mosaic of sequence-levels motifs, it is significant to unravel the synergies of macromolecular coordination utilized in cell-specific differential synthesis of proteins. This article provides an overview of the various pattern representation methodologies and the surveys the pattern databases available for use to the molecular biologists. Our aim is to describe the principles behind the computational modeling and analysis techniques utilized in bioinformatics research, with the objective of providing insight necessary to better understand and effectively utilize the available databases and analysis tools. We also provide a detailed review of DNA sequence level patterns responsible for structural conformations within the Scaffold or Matrix Attachment Regions (S/MARs).

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Databases, Genetic*
  • Gene Expression Profiling / methods
  • Gene Expression Regulation / physiology*
  • Genomics / methods*
  • Humans
  • Models, Genetic*
  • Sequence Analysis / methods*
  • Signal Transduction / physiology
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors