DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction

BMC Bioinformatics. 2020 Apr 23;21(Suppl 3):63. doi: 10.1186/s12859-020-3342-z.

Abstract

Background: Protein succinylation has recently emerged as an important and common post-translation modification (PTM) that occurs on lysine residues. Succinylation is notable both in its size (e.g., at 100 Da, it is one of the larger chemical PTMs) and in its ability to modify the net charge of the modified lysine residue from + 1 to - 1 at physiological pH. The gross local changes that occur in proteins upon succinylation have been shown to correspond with changes in gene activity and to be perturbed by defects in the citric acid cycle. These observations, together with the fact that succinate is generated as a metabolic intermediate during cellular respiration, have led to suggestions that protein succinylation may play a role in the interaction between cellular metabolism and important cellular functions. For instance, succinylation likely represents an important aspect of genomic regulation and repair and may have important consequences in the etiology of a number of disease states. In this study, we developed DeepSuccinylSite, a novel prediction tool that uses deep learning methodology along with embedding to identify succinylation sites in proteins based on their primary structure.

Results: Using an independent test set of experimentally identified succinylation sites, our method achieved efficiency scores of 79%, 68.7% and 0.48 for sensitivity, specificity and MCC respectively, with an area under the receiver operator characteristic (ROC) curve of 0.8. In side-by-side comparisons with previously described succinylation predictors, DeepSuccinylSite represents a significant improvement in overall accuracy for prediction of succinylation sites.

Conclusion: Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein succinylation.

Keywords: Convolutional neural network; Deep learning; Embedding; Long short-term memory; Recurrent neural network; Succinylation.

MeSH terms

  • Binding Sites
  • Citric Acid Cycle
  • Deep Learning*
  • Lysine / metabolism
  • Protein Processing, Post-Translational*
  • Proteins / chemistry
  • Proteins / metabolism*
  • Succinates / metabolism*

Substances

  • Proteins
  • Succinates
  • Lysine