DiagnoProt: a tool for discovery of new molecules by mass spectrometry

Bioinformatics. 2017 Jun 15;33(12):1883-1885. doi: 10.1093/bioinformatics/btx093.

Abstract

Motivation: Around 75% of all mass spectra remain unidentified by widely adopted proteomic strategies. We present DiagnoProt, an integrated computational environment that can efficiently cluster millions of spectra and use machine learning to shortlist high-quality unidentified mass spectra that are discriminative of different biological conditions.

Results: We exemplify the use of DiagnoProt by shortlisting 4366 high-quality unidentified tandem mass spectra that are discriminative of different types of the Aspergillus fungus.

Availability and implementation: DiagnoProt, a demonstration video and a user tutorial are available at http://patternlabforproteomics.org/diagnoprot .

Contact: andrerfsilva@gmail.com or paulo@pcarvalho.com.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Aspergillus / metabolism
  • Fungal Proteins / analysis
  • Machine Learning*
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods*
  • Software*
  • Tandem Mass Spectrometry / methods*

Substances

  • Fungal Proteins