Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology

PeerJ. 2013 Sep 17:1:e167. doi: 10.7717/peerj.167. eCollection 2013.

Abstract

The Galaxy Project offers the popular web browser-based platform Galaxy for running bioinformatics tools and constructing simple workflows. Here, we present a broad collection of additional Galaxy tools for large scale analysis of gene and protein sequences. The motivating research theme is the identification of specific genes of interest in a range of non-model organisms, and our central example is the identification and prediction of "effector" proteins produced by plant pathogens in order to manipulate their host plant. This functional annotation of a pathogen's predicted capacity for virulence is a key step in translating sequence data into potential applications in plant pathology. This collection includes novel tools, and widely-used third-party tools such as NCBI BLAST+ wrapped for use within Galaxy. Individual bioinformatics software tools are typically available separately as standalone packages, or in online browser-based form. The Galaxy framework enables the user to combine these and other tools to automate organism scale analyses as workflows, without demanding familiarity with command line tools and scripting. Workflows created using Galaxy can be saved and are reusable, so may be distributed within and between research groups, facilitating the construction of a set of standardised, reusable bioinformatic protocols. The Galaxy tools and workflows described in this manuscript are open source and freely available from the Galaxy Tool Shed (http://usegalaxy.org/toolshed or http://toolshed.g2.bx.psu.edu).

Keywords: Accessibility; Annotation; Effector proteins; Galaxy; Genomics; Pipeline; Reproducibility; Sequence analysis; Workflow.

Grants and funding

PJC and LP are supported by the Scottish Government Rural and Environmental Research and Analysis Directorate. The work from BAG has been funded by the Excellence Initiative of the German Federal and State Governments through the Junior Research Group Program (ZUK 43). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.