Using Trawler_standalone to discover overrepresented motifs in DNA and RNA sequences derived from various experiments including chromatin immunoprecipitation

Yannick Haudry; Mirana Ramialison; Benedict Paten; Joachim Wittbrodt; Laurence Ettwiller

doi:10.1038/nprot.2009.158

Using Trawler_standalone to discover overrepresented motifs in DNA and RNA sequences derived from various experiments including chromatin immunoprecipitation

Nat Protoc. 2010 Feb;5(2):323-34. doi: 10.1038/nprot.2009.158. Epub 2010 Feb 4.

Authors

Yannick Haudry¹, Mirana Ramialison, Benedict Paten, Joachim Wittbrodt, Laurence Ettwiller

Affiliation

¹ Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.

PMID: 20134431
DOI: 10.1038/nprot.2009.158

Abstract

Genome-wide location analysis has become a standard technology to unravel gene regulation networks. The accurate characterization of nucleotide signatures in sequences is key to uncovering the regulatory logic but remains a computational challenge. This protocol describes how to best characterize these signatures (motifs) using the new standalone version of Trawler, which was designed and optimized to analyze chromatin immunoprecipitation (ChIP) data sets. In particular, we describe the three main steps of Trawler_standalone (motif discovery, clustering and visualization) and discuss the appropriate parameters to be used in each step depending on the data set and the biological questions addressed. Compared to five other motif discovery programs, Trawler_standalone is in most cases the fastest algorithm to accurately predict the correct motifs especially for large data sets. Its running time ranges within few seconds to several minutes, depending on the size of the data set and the parameters used. This protocol is best suited for bioinformaticians seeking to use Trawler_standalone in a high-throughput manner.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Motifs / genetics
Base Sequence
Chromatin / genetics*
Chromatin / isolation & purification
Cluster Analysis
DNA / chemistry
DNA / genetics*
Gene Expression Regulation
Internet
RNA / chemistry
RNA / genetics*
Repetitive Sequences, Nucleic Acid

Substances

Chromatin
RNA
DNA