Exploring the potential of public proteomics data

Proteomics. 2016 Jan;16(2):214-25. doi: 10.1002/pmic.201500295. Epub 2015 Dec 15.

Abstract

In a global effort for scientific transparency, it has become feasible and good practice to share experimental data supporting novel findings. Consequently, the amount of publicly available MS-based proteomics data has grown substantially in recent years. With some notable exceptions, this extensive material has however largely been left untouched. The time has now come for the proteomics community to utilize this potential gold mine for new discoveries, and uncover its untapped potential. In this review, we provide a brief history of the sharing of proteomics data, showing ways in which publicly available proteomics data are already being (re-)used, and outline potential future opportunities based on four different usage types: use, reuse, reprocess, and repurpose. We thus aim to assist the proteomics community in stepping up to the challenge, and to make the most of the rapidly increasing amount of public proteomics data.

Keywords: Bioinformatics; Computational proteomics; Data analysis; Data standards; Databases.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Computational Biology
  • Databases, Protein
  • Humans
  • Information Dissemination
  • Knowledge Bases
  • Molecular Sequence Annotation
  • Protein Processing, Post-Translational
  • Proteomics*