Exploring the potential of public proteomics data

Marc Vaudel; Kenneth Verheggen; Attila Csordas; Helge Raeder; Frode S Berven; Lennart Martens; Juan A Vizcaíno; Harald Barsnes

doi:10.1002/pmic.201500295

Exploring the potential of public proteomics data

Proteomics. 2016 Jan;16(2):214-25. doi: 10.1002/pmic.201500295. Epub 2015 Dec 15.

Authors

Marc Vaudel¹, Kenneth Verheggen^{2

3

4}, Attila Csordas⁵, Helge Raeder⁶, Frode S Berven^{1

7}, Lennart Martens^{2

3

4}, Juan A Vizcaíno⁵, Harald Barsnes^{1

6}

Affiliations

¹ Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway.
² Medical Biotechnology Center, VIB, Ghent, Belgium.
³ Department of Biochemistry, Ghent University, Ghent, Belgium.
⁴ Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium.
⁵ European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
⁶ Department of Clinical Science, KG Jebsen Center for Diabetes Research, University of Bergen, Bergen, Norway.
⁷ Department of Clinical Medicine, KG Jebsen Centre for Multiple Sclerosis Research, University of Bergen, Bergen, Norway.

Abstract

In a global effort for scientific transparency, it has become feasible and good practice to share experimental data supporting novel findings. Consequently, the amount of publicly available MS-based proteomics data has grown substantially in recent years. With some notable exceptions, this extensive material has however largely been left untouched. The time has now come for the proteomics community to utilize this potential gold mine for new discoveries, and uncover its untapped potential. In this review, we provide a brief history of the sharing of proteomics data, showing ways in which publicly available proteomics data are already being (re-)used, and outline potential future opportunities based on four different usage types: use, reuse, reprocess, and repurpose. We thus aim to assist the proteomics community in stepping up to the challenge, and to make the most of the rapidly increasing amount of public proteomics data.

Keywords: Bioinformatics; Computational proteomics; Data analysis; Data standards; Databases.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Animals
Computational Biology
Databases, Protein
Humans
Information Dissemination
Knowledge Bases
Molecular Sequence Annotation
Protein Processing, Post-Translational
Proteomics*

Abstract

Publication types

MeSH terms

Grants and funding