HemoNet: Predicting hemolytic activity of peptides with integrated feature learning

J Bioinform Comput Biol. 2021 Oct;19(5):2150021. doi: 10.1142/S0219720021500219. Epub 2021 Aug 5.

Abstract

Quantifying the hemolytic activity of peptides is a crucial step in the discovery of novel therapeutic peptides. Computational methods are attractive in this domain due to their ability to guide wet-lab experimental discovery or screening of peptides based on their hemolytic activity. However, existing methods are unable to accurately model various important aspects of this predictive problem such as the role of N/C-terminal modifications, D- and L- amino acids, etc. In this work, we have developed a novel neural network-based approach called HemoNet for predicting the hemolytic activity of peptides. The proposed method captures the contextual importance of different amino acids in a given peptide sequence using a specialized feature embedding in conjunction with SMILES-based fingerprint representation of N/C-terminal modifications. We have analyzed the predictive performance of the proposed method using stratified cross-validation in comparison with previous methods, non-redundant cross-validation as well as validation on external peptides and clinical antimicrobial peptides. Our analysis shows the proposed approach achieves significantly better predictive performance (AUC-ROC of 88%) in comparison to previous approaches (HemoPI and HemoPred with AUC-ROC of 73%). HemoNet can be a useful tool in the search for novel therapeutic peptides. The python implementation of the proposed method is available at the URL: https://github.com/adibayaseen/HemoNet.

Keywords: Hemolytic activity prediction; antimicrobial activity; machine learning guided drug discovery; peptide toxicity classification.

MeSH terms

  • Amino Acid Sequence
  • Antimicrobial Peptides*
  • Hemolysis
  • Humans
  • Machine Learning*
  • Peptides

Substances

  • Antimicrobial Peptides
  • Peptides