Selection and combination of machine learning classifiers for prediction of linear B-cell epitopes on proteins

J Mol Recognit. 2006 May-Jun;19(3):209-14. doi: 10.1002/jmr.770.

Abstract

Recently, new machine learning classifiers for the prediction of linear B-cell epitopes were presented. Here we show the application of Receiver Operator Characteristics (ROC) convex hulls to select optimal classifiers as well as possibilities to improve the post test probability (PTP) to meet real world requirements such as high throughput epitope screening of whole proteomes. The major finding is that ROC convex hulls present an easy to use way to rank classifiers based on their prediction conservativity as well as to select candidates for ensemble classifiers when validating against the antigenicity profile of 10 HIV-1 proteins. We also show that linear models are at least equally efficient to model the available data when compared to multi-layer feed-forward neural networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence*
  • Epitopes, B-Lymphocyte / chemistry
  • Epitopes, B-Lymphocyte / classification
  • Epitopes, B-Lymphocyte / immunology*
  • HIV / metabolism
  • Linear Models
  • Neural Networks, Computer
  • Pattern Recognition, Automated / methods
  • Proteins / chemistry
  • Proteins / classification
  • Proteins / immunology*
  • ROC Curve
  • Reproducibility of Results
  • Viral Proteins / chemistry
  • Viral Proteins / classification
  • Viral Proteins / immunology

Substances

  • Epitopes, B-Lymphocyte
  • Proteins
  • Viral Proteins