A systematic identification of species-specific protein succinylation sites using joint element features information

Int J Nanomedicine. 2017 Aug 28:12:6303-6315. doi: 10.2147/IJN.S140875. eCollection 2017.

Abstract

Lysine succinylation, an important type of protein posttranslational modification, plays significant roles in many cellular processes. Accurate identification of succinylation sites can facilitate our understanding about the molecular mechanism and potential roles of lysine succinylation. However, even in well-studied systems, a majority of the succinylation sites remain undetected because the traditional experimental approaches to succinylation site identification are often costly, time-consuming, and laborious. In silico approach, on the other hand, is potentially an alternative strategy to predict succinylation substrates. In this paper, a novel computational predictor SuccinSite2.0 was developed for predicting generic and species-specific protein succinylation sites. This predictor takes the composition of profile-based amino acid and orthogonal binary features, which were used to train a random forest classifier. We demonstrated that the proposed SuccinSite2.0 predictor outperformed other currently existing implementations on a complementarily independent dataset. Furthermore, the important features that make visible contributions to species-specific and cross-species-specific prediction of protein succinylation site were analyzed. The proposed predictor is anticipated to be a useful computational resource for lysine succinylation site prediction. The integrated species-specific online tool of SuccinSite2.0 is publicly accessible.

Keywords: feature selection; machine learning; posttranslation modification; sequence encoding; succinylation site prediction.

MeSH terms

  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Protein
  • Lysine / chemistry
  • Lysine / metabolism*
  • Protein Processing, Post-Translational
  • Proteins / chemistry
  • Proteins / metabolism*
  • Software
  • Species Specificity
  • Succinic Acid / metabolism*

Substances

  • Proteins
  • Succinic Acid
  • Lysine