DeepAlgPro: an interpretable deep neural network model for predicting allergenic proteins

Brief Bioinform. 2023 Jul 20;24(4):bbad246. doi: 10.1093/bib/bbad246.

Abstract

Allergies have become an emerging public health problem worldwide. The most effective way to prevent allergies is to find the causative allergen at the source and avoid re-exposure. However, most of the current computational methods used to identify allergens were based on homology or conventional machine learning methods, which were inefficient and still had room to be improved for the detection of allergens with low homology. In addition, few methods based on deep learning were reported, although deep learning has been successfully applied to several tasks in protein sequence analysis. In the present work, a deep neural network-based model, called DeepAlgPro, was proposed to identify allergens. We showed its great accuracy and applicability to large-scale forecasts by comparing it to other available tools. Additionally, we used ablation experiments to demonstrate the critical importance of the convolutional module in our model. Moreover, further analyses showed that epitope features contributed to model decision-making, thus improving the model's interpretability. Finally, we found that DeepAlgPro was capable of detecting potential new allergens. Overall, DeepAlgPro can serve as powerful software for identifying allergens.

Keywords: allergen; attention mechanism; convolution; deep learning; epitope.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Allergens
  • Deep Learning*
  • Humans
  • Hypersensitivity*
  • Neural Networks, Computer
  • Proteins / metabolism

Substances

  • Allergens
  • Proteins