jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

Proteomics. 2012 Mar;12(6):795-8. doi: 10.1002/pmic.201100578.

Abstract

We here present the jmzReader library: a collection of Java application programming interfaces (APIs) to parse the most commonly used peak list and XML-based mass spectrometry (MS) data formats: DTA, MS2, MGF, PKL, mzXML, mzData, and mzML (based on the already existing API jmzML). The library is optimized to be used in conjunction with mzIdentML, the recently released standard data format for reporting protein and peptide identifications, developed by the HUPO proteomics standards initiative (PSI). mzIdentML files do not contain spectra data but contain references to different kinds of external MS data files. As a key functionality, all parsers implement a common interface that supports the various methods used by mzIdentML to reference external spectra. Thus, when developing software for mzIdentML, programmers no longer have to support multiple MS data file formats but only this one interface. The library (which includes a viewer) is open source and, together with detailed documentation, can be downloaded from http://code.google.com/p/jmzreader/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein
  • Electronic Data Processing / methods
  • Electronic Data Processing / standards
  • Mass Spectrometry / methods*
  • Mass Spectrometry / standards
  • Proteins / chemistry
  • Proteomics / methods*
  • Proteomics / standards
  • Software* / standards
  • User-Computer Interface

Substances

  • Proteins