HeapMS: An Automatic Peak-Picking Pipeline for Targeted Proteomic Data Powered by 2D Heatmap Transformation and Convolutional Neural Networks

Anal Chem. 2023 Oct 24;95(42):15486-15496. doi: 10.1021/acs.analchem.3c01011. Epub 2023 Oct 11.

Abstract

The process of peak picking and quality assessment for multiple reaction monitoring (MRM) data demands significant human effort, especially for signals with low abundance and high interference. Although multiple peak-picking software packages are available, they often fail to detect peaks with low quality and do not report cases with low confidence. Furthermore, visual examination of all chromatograms is still necessary to identify uncertain or erroneous cases. This study introduces HeapMS, a web service that uses artificial intelligence to assist with peak picking and the quality assessment of MRM chromatograms. HeapMS applies a rule-based filter to remove chromatograms with low interference and high-confidence peak boundaries detected by Skyline. Additionally, it transforms two histograms (representing light and heavy peptides) into a single encoded heatmap and performs a two-step evaluation (quality detection and peak picking) using image convolutional neural networks. HeapMS offers three categories of peak picking: uncertain peak picking that requires manual inspection, deletion peak picking that requires removal or manual re-examination, and automatic peak picking. HeapMS acquires the chromatogram and peak-picking boundaries directly from Skyline output. The output results are imported back into Skyline for further manual inspection, facilitating integration with Skyline. HeapMS offers the benefit of detecting chromatograms that should be deleted or require human inspection. Based on defined categories, it can significantly reduce human workload and provide consistent results. Furthermore, by using heatmaps instead of histograms, HeapMS can adapt to future updates in image recognition models. The HeapMS is available at: https://github.com/ccllabe/HeapMS.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Humans
  • Neural Networks, Computer
  • Proteomics
  • Software