Bottom up proteomics data analysis strategies to explore protein modifications and genomic variants

Proteomics. 2015 Jun;15(11):1789-92. doi: 10.1002/pmic.201400186. Epub 2015 Mar 30.

Abstract

The quest to understand biological systems requires further attention of the scientific community to the challenges faced in proteomics. In fact the complexity of the proteome reaches uncountable orders of magnitude. This means that significant technical and data-analytic innovations will be needed for the full understanding of biology. Current state of art MS is probably our best choice for studying protein complexity and exploring new ways to use MS and MS derived data should be given higher priority. We present here a brief overview of visualization and statistical analysis strategies for quantitative peptide values on an individual protein basis. These analysis strategies can help pinpoint protein modifications, splice, and genomic variants of biological relevance. We demonstrate the application of these data analysis strategies using a bottom-up proteomics dataset obtained in a drug profiling experiment. Furthermore, we have also observed that the presented methods are useful for studying peptide distributions from clinical samples from a large number of individuals. We expect that the presented data analysis strategy will be useful in the future to define functional protein variants in biological model systems and disease studies. Therefore robust software implementing these strategies is urgently needed.

Keywords: Bioinformatics; Computational MS; Data visualization; Peptide quantitation; Proteoforms; Proteogenomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Calnexin / analysis
  • Calnexin / metabolism
  • Computational Biology / methods
  • Genetic Variation
  • Genomics
  • Glucosamine / pharmacology
  • Humans
  • Mass Spectrometry / methods*
  • Molecular Sequence Data
  • Protein Processing, Post-Translational*
  • Proteins / analysis*
  • Proteins / genetics
  • Proteins / metabolism*
  • Proteomics / methods*
  • Software

Substances

  • Proteins
  • Calnexin
  • Glucosamine