cACP-DeepGram: Classification of anticancer peptides via deep neural network and skip-gram-based word embedding model

Artif Intell Med. 2022 Sep:131:102349. doi: 10.1016/j.artmed.2022.102349. Epub 2022 Jul 6.

Abstract

Cancer is a Toxic health concern worldwide, it happens when cellular modifications cause the irregular growth and division of human cells. Several traditional approaches such as therapies and wet laboratory-based methods have been applied to treat cancer cells. However, these methods are considered less effective due to their high cost and diverse side effects. According to recent advancements, peptide-based therapies have attracted the attention of scientists because of their high selectivity. Peptide therapy can efficiently treat the targeted cells, without affecting the normal cells. Due to the rapid increase of peptide sequences, an accurate prediction model has become a challenging task. Keeping the significance of anticancer peptides (ACPs) in cancer treatment, an intelligent and reliable prediction model is highly indispensable. In this paper, a FastText-based word embedding strategy has been employed to represent each peptide sample via a skip-gram model. After extracting the peptide embedding descriptors, the deep neural network (DNN) model was applied to accurately discriminate the ACPs. The optimized parameters of DNN achieved an accuracy of 96.94 %, 93.41 %, and 94.02 % using training, alternate, and independent samples, respectively. It was observed that our proposed cACP-DeepGram model outperformed and reported ~10 % highest prediction accuracy than existing predictors. It is suggested that the cACP-DeepGram model will be a reliable tool for scientists and might play a valuable role in academic research and drug discovery. The source code and the datasets are publicly available at https://github.com/shahidakbarcs/cACP-DeepGram.

Keywords: Anticancer peptides; Classification; Deep neural network; FastText; Word embedding.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Humans
  • Neural Networks, Computer*
  • Peptides*
  • Software

Substances

  • Peptides