An Accessible Proteogenomics Informatics Resource for Cancer Researchers

Matthew C Chambers; Pratik D Jagtap; James E Johnson; Thomas McGowan; Praveen Kumar; Getiria Onsongo; Candace R Guerrero; Harald Barsnes; Marc Vaudel; Lennart Martens; Björn Grüning; Ira R Cooke; Mohammad Heydarian; Karen L Reddy; Timothy J Griffin

doi:10.1158/0008-5472.CAN-17-0331

An Accessible Proteogenomics Informatics Resource for Cancer Researchers

Cancer Res. 2017 Nov 1;77(21):e43-e46. doi: 10.1158/0008-5472.CAN-17-0331.

Authors

Matthew C Chambers¹, Pratik D Jagtap², James E Johnson³, Thomas McGowan³, Praveen Kumar^{2

4}, Getiria Onsongo³, Candace R Guerrero², Harald Barsnes^{5

6}, Marc Vaudel^{7

8}, Lennart Martens^{9

10

11}, Björn Grüning^{12

13}, Ira R Cooke¹⁴, Mohammad Heydarian¹⁵, Karen L Reddy¹⁶, Timothy J Griffin¹⁷

Affiliations

¹ Department of Biochemistry, Vanderbilt University, Nashville, Tennessee.
² Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota.
³ Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota.
⁴ Bioinformatics and Computational Biology Program, University of Minnesota-Rochester, Rochester, Minnesota.
⁵ Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway.
⁶ Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway.
⁷ KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway.
⁸ Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway.
⁹ VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.
¹⁰ Department of Biochemistry, Ghent University, Ghent, Belgium.
¹¹ Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium.
¹² Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany.
¹³ Center for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany.
¹⁴ Comparative Genomics Centre and Department of Molecular and Cell Biology, James Cook University, Queensland, Australia.
¹⁵ Department of Biology, Johns Hopkins University, Baltimore, Maryland.
¹⁶ Department of Biological Chemistry, Center for Epigenetics and Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, Maryland.
¹⁷ Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota. tgriffin@umn.edu.

Abstract

Proteogenomics has emerged as a valuable approach in cancer research, which integrates genomic and transcriptomic data with mass spectrometry-based proteomics data to directly identify expressed, variant protein sequences that may have functional roles in cancer. This approach is computationally intensive, requiring integration of disparate software tools into sophisticated workflows, challenging its adoption by nonexpert, bench scientists. To address this need, we have developed an extensible, Galaxy-based resource aimed at providing more researchers access to, and training in, proteogenomic informatics. Our resource brings together software from several leading research groups to address two foundational aspects of proteogenomics: (i) generation of customized, annotated protein sequence databases from RNA-Seq data; and (ii) accurate matching of tandem mass spectrometry data to putative variants, followed by filtering to confirm their novelty. Directions for accessing software tools and workflows, along with instructional documentation, can be found at z.umn.edu/canresgithub. Cancer Res; 77(21); e43-46. ©2017 AACR.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Genome, Human
Genomics / methods*
Humans
Neoplasms / genetics*
Proteomics / methods
Software*
Tandem Mass Spectrometry
Transcriptome / genetics

Grants and funding

U24 CA199347/CA/NCI NIH HHS/United States