Phyloscan: locating transcription-regulating binding sites in mixed aligned and unaligned sequence data

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W268-74. doi: 10.1093/nar/gkq330. Epub 2010 Apr 30.

Abstract

The transcription of a gene from its DNA template into an mRNA molecule is the first, and most heavily regulated, step in gene expression. Especially in bacteria, regulation is typically achieved via the binding of a transcription factor (protein) or small RNA molecule to the chromosomal region upstream of a regulated gene. The protein or RNA molecule recognizes a short, approximately conserved sequence within a gene's promoter region and, by binding to it, either enhances or represses expression of the nearby gene. Since the sought-for motif (pattern) is short and accommodating to variation, computational approaches that scan for binding sites have trouble distinguishing functional sites from look-alikes. Many computational approaches are unable to find the majority of experimentally verified binding sites without also finding many false positives. Phyloscan overcomes this difficulty by exploiting two key features of functional binding sites: (i) these sites are typically more conserved evolutionarily than are non-functional DNA sequences; and (ii) these sites often occur two or more times in the promoter region of a regulated gene. The website is free and open to all users, and there is no login requirement. Address: (http://bayesweb.wadsworth.org/phyloscan/).

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Binding Sites
  • Gene Expression Regulation*
  • Internet
  • Promoter Regions, Genetic*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Software*
  • Transcription Factors / metabolism
  • Transcription, Genetic*
  • User-Computer Interface

Substances

  • Transcription Factors