Protein spot detection and quantification in 2-DE gel images using machine-learning methods

Proteomics. 2011 May;11(10):2038-50. doi: 10.1002/pmic.201000601. Epub 2011 Apr 18.

Abstract

Two-dimensional gel electrophoresis (2-DE) is the most established protein separation method used in expression proteomics. Despite the existence of sophisticated software tools, 2-DE gel image analysis still remains a serious bottleneck. The low accuracies of commercial software packages and the extensive manual calibration that they often require for acceptable results show that we are far from achieving the goal of a fully automated and reliable, high-throughput gel processing system. We present a novel spot detection and quantification methodology which draws heavily from unsupervised machine-learning methods. Using the proposed hierarchical machine learning-based segmentation methodology reduces both the number of faint spots missed (improves sensitivity) and the number of extraneous spots introduced (improves precision). The detection and quantification performance has been thoroughly evaluated and is shown to compare favorably (higher F-measure) to a commercially available software package (PDQuest). The whole image analysis pipeline that we have developed is fully automated and can be used for high-throughput proteomics analysis since it does not require any manual intervention for recalibration every time a new 2-DE gel image is to be analyzed. Furthermore, it can be easily parallelized for high performance and also applied without any modification to prealigned group average gels.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence*
  • Calibration
  • Electrophoresis, Gel, Two-Dimensional / methods*
  • Image Processing, Computer-Assisted / methods*
  • Proteins / analysis*
  • Sensitivity and Specificity

Substances

  • Proteins