Attentive Variational Information Bottleneck for TCR-peptide interaction prediction

Bioinformatics. 2023 Jan 1;39(1):btac820. doi: 10.1093/bioinformatics/btac820.

Abstract

Motivation: We present a multi-sequence generalization of Variational Information Bottleneck and call the resulting model Attentive Variational Information Bottleneck (AVIB). Our AVIB model leverages multi-head self-attention to implicitly approximate a posterior distribution over latent encodings conditioned on multiple input sequences. We apply AVIB to a fundamental immuno-oncology problem: predicting the interactions between T-cell receptors (TCRs) and peptides.

Results: Experimental results on various datasets show that AVIB significantly outperforms state-of-the-art methods for TCR-peptide interaction prediction. Additionally, we show that the latent posterior distribution learned by AVIB is particularly effective for the unsupervised detection of out-of-distribution amino acid sequences.

Availability and implementation: The code and the data used for this study are publicly available at: https://github.com/nec-research/vibtcr.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Amino Acid Sequence
  • Peptides*
  • Receptors, Antigen, T-Cell / genetics
  • Software*

Substances

  • Peptides
  • Receptors, Antigen, T-Cell