Predicting lysine-malonylation sites of proteins using sequence and predicted structural features

J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.

Abstract

Malonylation is a recently discovered post-translational modification (PTM) in which a malonyl group attaches to a lysine (K) amino acid residue of a protein. In this work, a novel machine learning model, SPRINT-Mal, is developed to predict malonylation sites by employing sequence and predicted structural features. Evolutionary information and physicochemical properties are found to be the two most discriminative features whereas a structural feature called half-sphere exposure provides additional improvement to the prediction performance. SPRINT-Mal trained on mouse data yields robust performance for 10-fold cross validation and independent test set with Area Under the Curve (AUC) values of 0.74 and 0.76 and Matthews' Correlation Coefficient (MCC) of 0.213 and 0.20, respectively. Moreover, SPRINT-Mal achieved comparable performance when testing on H. sapiens proteins without species-specific training but not in bacterium S. erythraea. This suggests similar underlying physicochemical mechanisms between mouse and human but not between mouse and bacterium. SPRINT-Mal is freely available as an online server at: http://sparks-lab.org/server/SPRINT-Mal/. © 2018 Wiley Periodicals, Inc.

Keywords: lysine-malonylation sites prediction; post translational modification; support vector machines.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / metabolism
  • Hominidae / metabolism
  • Humans
  • Lysine / chemistry*
  • Lysine / metabolism
  • Machine Learning*
  • Malonates / chemistry*
  • Malonates / metabolism
  • Mice
  • Molecular Structure
  • Protein Processing, Post-Translational
  • Saccharopolyspora / chemistry
  • Saccharopolyspora / metabolism

Substances

  • Bacterial Proteins
  • Malonates
  • Lysine