Multi-omics Visualization Platform: An extensible Galaxy plug-in for multi-omics data visualization and exploration

Gigascience. 2020 Apr 1;9(4):giaa025. doi: 10.1093/gigascience/giaa025.

Abstract

Background: Proteogenomics integrates genomics, transcriptomics, and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate 'omic software tools, as well as customized tools to view and interpret results. The flexible Galaxy platform has proven valuable for proteogenomic data analysis. Here, we describe a novel Multi-omics Visualization Platform (MVP) for organizing, visualizing, and exploring proteogenomic results, adding a critically needed tool for data exploration and interpretation.

Findings: MVP is built as an HTML Galaxy plug-in, primarily based on JavaScript. Via the Galaxy API, MVP uses SQLite databases as input-a custom data type (mzSQLite) containing MS-based peptide identification information, a variant annotation table, and a coding sequence table. Users can interactively filter identified peptides based on sequence and data quality metrics, view annotated peptide MS data, and visualize protein-level information, along with genomic coordinates. Peptides that pass the user-defined thresholds can be sent back to Galaxy via the API for further analysis; processed data and visualizations can also be saved and shared. MVP leverages the Integrated Genomics Viewer JavaScript framework, enabling interactive visualization of peptides and corresponding transcript and genomic coding information within the MVP interface.

Conclusions: MVP provides a powerful, extensible platform for automated, interactive visualization of proteogenomic results within the Galaxy environment, adding a unique and critically needed tool for empowering exploration and interpretation of results. The platform is extensible, providing a basis for further development of new functionalities for proteogenomic data visualization.

Keywords: Galaxy; Integrated Genomics Viewer; RNA-Seq; mass spectrometry; proteogenomics; proteomics; transcriptomics; visualization.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence / genetics
  • Computational Biology / trends
  • Data Visualization*
  • Genome / genetics*
  • Genomics / trends
  • Humans
  • Mass Spectrometry
  • Open Reading Frames
  • Peptides / genetics
  • Proteome / genetics*
  • Proteomics*

Substances

  • Peptides
  • Proteome