Detecting distant homologies on protozoans metabolic pathways using scientific workflows

Int J Data Min Bioinform. 2010;4(3):256-80. doi: 10.1504/ijdmb.2010.033520.

Abstract

Bioinformatics experiments are typically composed of programs in pipelines manipulating an enormous quantity of data. An interesting approach for managing those experiments is through workflow management systems (WfMS). In this work we discuss WfMS features to support genome homology workflows and present some relevant issues for typical genomic experiments. Our evaluation used Kepler WfMS to manage a real genomic pipeline, named OrthoSearch, originally defined as a Perl script. We show a case study detecting distant homologies on trypanomatids metabolic pathways. Our results reinforce the benefits of WfMS over script languages and point out challenges to WfMS in distributed environments.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Protozoan*
  • Genomics / methods*
  • Metabolic Networks and Pathways / genetics
  • Science
  • Sequence Homology
  • Workflow*