Computational prediction of conformational B-cell epitopes from antigen primary structures by ensemble learning

PLoS One. 2012;7(8):e43575. doi: 10.1371/journal.pone.0043575. Epub 2012 Aug 21.

Abstract

Motivation: The conformational B-cell epitopes are the specific sites on the antigens that have immune functions. The identification of conformational B-cell epitopes is of great importance to immunologists for facilitating the design of peptide-based vaccines. As an attempt to narrow the search for experimental validation, various computational models have been developed for the epitope prediction by using antigen structures. However, the application of these models is undermined by the limited number of available antigen structures. In contrast to the most of available structure-based methods, we here attempt to accurately predict conformational B-cell epitopes from antigen sequences.

Methods: In this paper, we explore various sequence-derived features, which have been observed to be associated with the location of epitopes or ever used in the similar tasks. These features are evaluated and ranked by their discriminative performance on the benchmark datasets. From the perspective of information science, the combination of various features can usually lead to better results than the individual features. In order to build the robust model, we adopt the ensemble learning approach to incorporate various features, and develop the ensemble model to predict conformational epitopes from antigen sequences.

Results: Evaluated by the leave-one-out cross validation, the proposed method gives out the mean AUC scores of 0.687 and 0.651 on two datasets respectively compiled from the bound structures and unbound structures. When compared with publicly available servers by using the independent dataset, our method yields better or comparable performance. The results demonstrate the proposed method is useful for the sequence-based conformational epitope prediction.

Availability: The web server and datasets are freely available at http://bcell.whu.edu.cn.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Artificial Intelligence*
  • Computational Biology / methods*
  • Epitopes, B-Lymphocyte / chemistry*
  • Epitopes, B-Lymphocyte / immunology
  • Protein Conformation

Substances

  • Epitopes, B-Lymphocyte

Grants and funding

This work is supported by the National Science Foundation of China (60970063, 61103126, http://www.nsfc.gov.cn/), the Ph.D. Programs Foundation of Ministry of Education of China (20090141110026, 20100141120049, http://www.moe.gov.cn/), Program for New Century Excellent Talents in University (NCET-10-0644, http://www.moe.gov.cn/), Natural Science Foundation of Hubei Province (No. 2011CDB454, http://www.hbstd.gov.cn/) and the Fundamental Research Funds for the Central Universities of China (6081007, 3101054, http://kfy.whu.edu.cn/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.