Quantitative modeling of peptide binding to TAP using support vector machine

Proteins. 2010 Jan;78(1):63-72. doi: 10.1002/prot.22535.

Abstract

The transport of peptides to the endoplasmic reticulum by the transporter associated with antigen processing (TAP) is a necessary step towards determining CD8 T cell epitopes. In this work, we have studied the predictive performance of support vector machine models trained on single residue positions and residue combinations drawn from a large dataset consisting of 613 nonamer peptides of known affinity to TAP. Predictive performance of these TAP affinity models was evaluated under 10-fold cross-validation experiments and measured using Pearson's correlation coefficients (R(p)). Our results show that every peptide position (P1-P9) contributes to TAP binding (minimum R(p) of 0.26 +/- 0.11 was achieved by a model trained on the P6 residue), although the largest contributions to binding correspond to the C-terminal end (R(p) = 0.68 +/- 0.06) and the P1 (R(p) = 0.51 +/- 0.09) and P2 (0.57 +/- 0.08) residues of the peptide. Training the models on additional peptide residues generally improved their predictive performance and a maximum correlation (R(p) = 0.89 +/- 0.03) was achieved by a model trained on the full-length sequences or a residue selection consisting of the first 5 N- and last 3 C-terminal residues of the peptides included in the training set. A system for predicting the binding affinity of peptides to TAP using the methods described here is readily available for free public use at http://imed.med.ucm.es/Tools/tapreg/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • ATP-Binding Cassette Transporters / metabolism*
  • Amino Acid Sequence
  • Artificial Intelligence
  • Databases, Protein
  • Models, Biological
  • Peptides / chemistry
  • Peptides / metabolism*
  • Protein Binding
  • Protein Transport

Substances

  • ATP-Binding Cassette Transporters
  • Peptides
  • transporter associated with antigen processing (TAP)