PeakML/mzMatch: a file format, Java library, R library, and tool-chain for mass spectrometry data analysis

Anal Chem. 2011 Apr 1;83(7):2786-93. doi: 10.1021/ac2000994. Epub 2011 Mar 14.

Abstract

The recent proliferation of high-resolution mass spectrometers has generated a wealth of new data analysis methods. However, flexible integration of these methods into configurations best suited to the research question is hampered by heterogeneous file formats and monolithic software development. The mzXML, mzData, and mzML file formats have enabled uniform access to unprocessed raw data. In this paper we present our efforts to produce an equally simple and powerful format, PeakML, to uniformly exchange processed intermediary and result data. To demonstrate the versatility of PeakML, we have developed an open source Java toolkit for processing, filtering, and annotating mass spectra in a customizable pipeline (mzMatch), as well as a user-friendly data visualization environment (PeakML Viewer). The PeakML format in particular enables the flexible exchange of processed data between software created by different groups or companies, as we illustrate by providing a PeakML-based integration of the widely used XCMS package with mzMatch data processing tools. As an added advantage, downstream analysis can benefit from direct access to the full mass trace information underlying summarized mass spectrometry results, providing the user with the means to rapidly verify results. The PeakML/mzMatch software is freely available at http://mzmatch.sourceforge.net, with documentation, tutorials, and a community forum.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Graphics
  • Information Storage and Retrieval / methods*
  • Mass Spectrometry / methods*
  • Software*
  • Statistics as Topic / methods*