GenomicTools: a computational platform for developing high-throughput analytics in genomics

Bioinformatics. 2012 Jan 15;28(2):282-3. doi: 10.1093/bioinformatics/btr646. Epub 2011 Nov 22.

Abstract

Motivation: Recent advances in sequencing technology have resulted in the dramatic increase of sequencing data, which, in turn, requires efficient management of computational resources, such as computing time, memory requirements as well as prototyping of computational pipelines.

Results: We present GenomicTools, a flexible computational platform, comprising both a command-line set of tools and a C++ API, for the analysis and manipulation of high-throughput sequencing data such as DNA-seq, RNA-seq, ChIP-seq and MethylC-seq. GenomicTools implements a variety of mathematical operations between sets of genomic regions thereby enabling the prototyping of computational pipelines that can address a wide spectrum of tasks ranging from pre-processing and quality control to meta-analyses. Additionally, the GenomicTools platform is designed to analyze large datasets of any size by minimizing memory requirements. In practical applications, where comparable, GenomicTools outperforms existing tools in terms of both time and memory usage.

Availability: The GenomicTools platform (version 2.0.0) was implemented in C++. The source code, documentation, user manual, example datasets and scripts are available online at http://code.google.com/p/ibm-cbc-genomic-tools.

MeSH terms

  • Computational Biology / methods
  • Genome, Human
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Software*