An overview of technologies for MS-based proteomics-centric multi-omics

Expert Rev Proteomics. 2022 Mar;19(3):165-181. doi: 10.1080/14789450.2022.2070476. Epub 2022 May 2.

Abstract

Introduction: Mass spectrometry-based proteomics reveals dynamic molecular signatures underlying phenotypes reflecting normal and perturbed conditions in living systems. Although valuable on its own, the proteome has only one level of moleclar information, with the genome, epigenome, transcriptome, and metabolome, all providing complementary information. Multi-omic analysis integrating information from one or more of these other domains with proteomic information provides a more complete picture of molecular contributors to dynamic biological systems.

Areas covered: Here, we discuss the improvements to mass spectrometry-based technologies, focused on peptide-based, bottom-up approaches that have enabled deep, quantitative characterization of complex proteomes. These advances are facilitating the integration of proteomics data with other 'omic information, providing a more complete picture of living systems. We also describe the current state of bioinformatics software and approaches for integrating proteomics and other 'omics data, critical for enabling new discoveries driven by multi-omics.

Expert commentary: Multi-omics, centered on the integration of proteomics information with other 'omic information, has tremendous promise for biological and biomedical studies. Continued advances in approaches for generating deep, reliable proteomic data and bioinformatics tools aimed at integrating data across 'omic domains will ensure the discoveries offered by these multi-omic studies continue to increase.

Keywords: Mass spectrometry; bioinformatics; bottom-up proteomics; multi-omics; proteogenomics.

Plain language summary

Proteomics uses mass spectrometry to identify as many of the proteins in a system of interest as possible, making it extremely useful in biomedical research and basic biological research. Unlike next-generation DNA/genome sequencing, proteomics directly measures the changes in gene translation in response to a disease state, injury, etc. However, when proteomics data is coupled to and examined together with other forms of ‘omics’ data, such as transcriptomics, genomics, and metabolomics, a full biological picture emerges that can demonstrate the underlying regulatory networks of living systems and how they respond to positive and negative stimuli. This integration is called multi-omics and represents a powerful paradigm shift in systems biology. To be fully compatible with other ‘omics datasets, proteomics must be as complete and accurate as possible; in addition, the task of integrating multiple different kinds of datasets can be daunting to novice researchers. With this in mind, we reviewed in this manuscript the technologies that allow for the generation of the best possible proteomics for multi-omics analysis, in addition to the software tools needed to integrate proteomics data with other ‘omics data. Together, we believe this review will enable other researchers to begin applying multi-omics approaches to answer their research questions.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology
  • Mass Spectrometry
  • Proteome*
  • Proteomics*
  • Software

Substances

  • Proteome