A Primer for Access to Repositories of Cancer-Related Genomic Big Data

Methods Mol Biol. 2019:1878:1-37. doi: 10.1007/978-1-4939-8868-6_1.

Abstract

The use of large datasets has become ubiquitous in biomedical sciences. Researchers in the field of cancer genomics have, in recent years, generated large volumes of data from their experiments. Those responsible for production of this data often analyze a narrow subset of this data based on the research question they are trying to address: this is the case whether or not they are acting independently or in conjunction with a large-scale cancer genomics project. The reality of this situation creates the opportunity for other researchers to repurpose this data for different hypotheses if the data is made easily and freely available. New insights in biology resulting from more researchers having access to data they otherwise would be unable to generate on their own are a boon for the field. The following chapter reviews several cancer genomics-related databases and outlines the type of data they contain, as well as the methods required to access each database. While this list is not comprehensive, it should provide a basis for cancer researchers to begin exploring some of the many large datasets that are available to them.

Keywords: Cancer genomics; Cancer ontology; Cancer resources; Genomics databases.

Publication types

  • Review

MeSH terms

  • Databases, Genetic
  • Genomics / methods
  • Humans
  • Neoplasms / genetics*
  • Research