MITNet: a fusion transformer and convolutional neural network architecture approach for T-cell epitope prediction

Brief Bioinform. 2023 Jul 20;24(4):bbad202. doi: 10.1093/bib/bbad202.

Abstract

Classifying epitopes is essential since they can be applied in various fields, including therapeutics, diagnostics and peptide-based vaccines. To determine the epitope or peptide against an antibody, epitope mapping with peptides is the most extensively used method. However, this method is more time-consuming and inefficient than using present methods. The ability to retrieve data on protein sequences through laboratory procedures has led to the development of computational models that predict epitope binding based on machine learning and deep learning (DL). It has also evolved to become a crucial part of developing effective cancer immunotherapies. This paper proposes an architecture to generalize this case since various research strives to solve a low-performance classification problem. A proposed DL model is the fusion architecture, which combines two architectures: Transformer architecture and convolutional neural network (CNN), called MITNet and MITNet-Fusion. Combining these two architectures enriches feature space to correlate epitope labels with the binary classification method. The selected epitope-T-cell receptor (TCR) interactions are GILG, GLCT and NLVP, acquired from three databases: IEDB, VDJdb and McPAS-TCR. The previous input data was extracted using amino acid composition, dipeptide composition, spectrum descriptor and the combination of all those features called AADIP composition to encode the input data to DL architecture. For ensuring consistency, fivefold cross-validations were performed using the area under curve metric. Results showed that GILG, GLCT and NLVP received scores of 0.85, 0.87 and 0.86, respectively. Those results were compared to prior architecture and outperformed other similar deep learning models.

Keywords: CNN; deep learning; epitope classification; fusion architecture; transformer.

MeSH terms

  • Amino Acid Sequence
  • Epitopes, T-Lymphocyte*
  • Neural Networks, Computer*
  • Peptides / chemistry
  • Receptors, Antigen, T-Cell

Substances

  • Epitopes, T-Lymphocyte
  • Peptides
  • Receptors, Antigen, T-Cell