RTCGAToolbox: a new tool for exporting TCGA Firehose data

PLoS One. 2014 Sep 2;9(9):e106397. doi: 10.1371/journal.pone.0106397. eCollection 2014.

Abstract

Background & objective: Managing data from large-scale projects (such as The Cancer Genome Atlas (TCGA)) for further analysis is an important and time consuming step for research projects. Several efforts, such as the Firehose project, make TCGA pre-processed data publicly available via web services and data portals, but this information must be managed, downloaded and prepared for subsequent steps. We have developed an open source and extensible R based data client for pre-processed data from the Firehouse, and demonstrate its use with sample case studies. Results show that our RTCGAToolbox can facilitate data management for researchers interested in working with TCGA data. The RTCGAToolbox can also be integrated with other analysis pipelines for further data processing.

Availability and implementation: The RTCGAToolbox is open-source and licensed under the GNU General Public License Version 2.0. All documentation and source code for RTCGAToolbox is freely available at http://mksamur.github.io/RTCGAToolbox/ for Linux and Mac OS X operating systems.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Breast Neoplasms / genetics
  • Computational Biology / methods*
  • Databases, Genetic*
  • Female
  • Genome, Human / genetics*
  • Humans
  • Neoplasms / genetics*
  • Phosphatidylinositol 3-Kinases / genetics
  • Software*

Substances

  • Phosphatidylinositol 3-Kinases