Probabilistic approaches to transcription factor binding site prediction

Stefan Posch; Jan Grau; André Gohr; Jens Keilwagen; Ivo Grosse

doi:10.1007/978-1-60761-854-6_7

Probabilistic approaches to transcription factor binding site prediction

Methods Mol Biol. 2010:674:97-119. doi: 10.1007/978-1-60761-854-6_7.

Authors

Stefan Posch¹, Jan Grau, André Gohr, Jens Keilwagen, Ivo Grosse

Affiliation

¹ Institute of Computer Science, Martin Luther University, Halle-Wittenberg, Germany.

PMID: 20827588
DOI: 10.1007/978-1-60761-854-6_7

Abstract

Many different computer programs for the prediction of transcription factor binding sites have been developed over the last decades. These programs differ from each other by pursuing different objectives and by taking into account different sources of information. For methods based on statistical approaches, these programs differ at an elementary level from each other by the statistical models used for individual binding sites and flanking sequences and by the learning principles employed for estimating the model parameters. According to our experience, both the models and the learning principles should be chosen with great care, depending on the specific task at hand, but many existing programs do not allow the user to choose them freely. Hence, we developed Jstacs, an object-oriented Java framework for sequence analysis, which allows the user to combine different statistical models and different learning principles in a modular manner with little effort. In this chapter we explain how Jstacs can be used for the recognition of transcription factor binding sites.

MeSH terms

Base Sequence
Binding Sites
Computational Biology / methods*
Humans
Likelihood Functions
Promoter Regions, Genetic / genetics
Receptors, Steroid / metabolism
Reproducibility of Results
Software
Transcription Factors / metabolism*

Substances

Receptors, Steroid
Transcription Factors