DeepAlgPro: an interpretable deep neural network model for predicting allergenic proteins

Chun He; Xinhai Ye; Yi Yang; Liya Hu; Yuxuan Si; Xianxin Zhao; Longfei Chen; Qi Fang; Ying Wei; Fei Wu; Gongyin Ye

doi:10.1093/bib/bbad246

DeepAlgPro: an interpretable deep neural network model for predicting allergenic proteins

Brief Bioinform. 2023 Jul 20;24(4):bbad246. doi: 10.1093/bib/bbad246.

Authors

Chun He¹, Xinhai Ye^{2

3}, Yi Yang¹, Liya Hu², Yuxuan Si², Xianxin Zhao¹, Longfei Chen¹, Qi Fang¹, Ying Wei⁴, Fei Wu^{2

3}, Gongyin Ye¹

Affiliations

¹ State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China.
² College of Computer Science and Technology, Zhejiang University, Hangzhou, China.
³ Shanghai Institute for Advanced Study, Zhejiang University, Shanghai, China.
⁴ Department of Computer Science, City University of Hong Kong, Hong Kong, China.

PMID: 37385595
DOI: 10.1093/bib/bbad246

Abstract

Allergies have become an emerging public health problem worldwide. The most effective way to prevent allergies is to find the causative allergen at the source and avoid re-exposure. However, most of the current computational methods used to identify allergens were based on homology or conventional machine learning methods, which were inefficient and still had room to be improved for the detection of allergens with low homology. In addition, few methods based on deep learning were reported, although deep learning has been successfully applied to several tasks in protein sequence analysis. In the present work, a deep neural network-based model, called DeepAlgPro, was proposed to identify allergens. We showed its great accuracy and applicability to large-scale forecasts by comparing it to other available tools. Additionally, we used ablation experiments to demonstrate the critical importance of the convolutional module in our model. Moreover, further analyses showed that epitope features contributed to model decision-making, thus improving the model's interpretability. Finally, we found that DeepAlgPro was capable of detecting potential new allergens. Overall, DeepAlgPro can serve as powerful software for identifying allergens.

Keywords: allergen; attention mechanism; convolution; deep learning; epitope.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Allergens
Deep Learning*
Humans
Hypersensitivity*
Neural Networks, Computer
Proteins / metabolism

Substances

Allergens
Proteins