iTTCA-Hybrid: Improved and robust identification of tumor T cell antigens by utilizing hybrid feature representation

Anal Biochem. 2020 Jun 15:599:113747. doi: 10.1016/j.ab.2020.113747. Epub 2020 Apr 22.

Abstract

In spite of the repertoire of existing cancer therapies, the ongoing recurrence and new cases of cancer poses a challenging health concern that prompts for novel and effective treatment. Cancer immunotherapy represents a promising venue for treatment by harnessing the body's immune system to combat cancer. Therefore, the identification of tumor T cell antigen represents an exciting area to explore. Computational tools have been instrumental in the identification of tumor T cell antigens and it is highly desirable to attain highly accurate models in a timely fashion from large volumes of peptides generated in the post-genomic era. In this study, we present a reliable, accurate, unbiased and automated sequence-based predictor named iTTCA-Hybrid for identifying tumor T cell antigens. The iTTCA-Hybrid approach proposed herein employs two robust machine learning models (e.g. support vector machine and random forest) constructed using five feature encoding strategies (i.e. amino acid composition, dipeptide composition, pseudo amino acid composition, distribution of amino acid properties in sequences and physicochemical properties derived from the AAindex). Rigorous independent test indicated that the iTTCA-Hybrid approach achieved an accuracy and area under the curve of 73.60% and 0.783, respectively, which corresponds to 4% and 7% performance increase than those of existing methods thereby indicating the superiority of the proposed model. To the best of our knowledge, the iTTCA-Hybrid is the first free web server (Available at http://camt.pythonanywhere.com/iTTCA-Hybrid) for identifying tumor T cell antigens presented by the MHC class I. The proposed web server allows robust predictions to be made without the need to develop in-house prediction models.

Keywords: Cancer immunotherapy; Machine learning; Random forest; Support vector machine; T-cell antigen; T-cell epitope.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antigens, Neoplasm / immunology*
  • Histocompatibility Antigens Class I / analysis*
  • Humans
  • Immunotherapy
  • Machine Learning*
  • Neoplasms / immunology*
  • Neoplasms / therapy
  • T-Lymphocytes / cytology
  • T-Lymphocytes / immunology*

Substances

  • Antigens, Neoplasm
  • Histocompatibility Antigens Class I