GPS-Uber: a hybrid-learning framework for prediction of general and E3-specific lysine ubiquitination sites

Brief Bioinform. 2022 Mar 10;23(2):bbab574. doi: 10.1093/bib/bbab574.

Abstract

As an important post-translational modification, lysine ubiquitination participates in numerous biological processes and is involved in human diseases, whereas the site specificity of ubiquitination is mainly decided by ubiquitin-protein ligases (E3s). Although numerous ubiquitination predictors have been developed, computational prediction of E3-specific ubiquitination sites is still a great challenge. Here, we carefully reviewed the existing tools for the prediction of general ubiquitination sites. Also, we developed a tool named GPS-Uber for the prediction of general and E3-specific ubiquitination sites. From the literature, we manually collected 1311 experimentally identified site-specific E3-substrate relations, which were classified into different clusters based on corresponding E3s at different levels. To predict general ubiquitination sites, we integrated 10 types of sequence and structure features, as well as three types of algorithms including penalized logistic regression, deep neural network and convolutional neural network. Compared with other existing tools, the general model in GPS-Uber exhibited a highly competitive accuracy, with an area under curve values of 0.7649. Then, transfer learning was adopted for each E3 cluster to construct E3-specific models, and in total 112 individual E3-specific predictors were implemented. Using GPS-Uber, we conducted a systematic prediction of human cancer-associated ubiquitination events, which could be helpful for further experimental consideration. GPS-Uber will be regularly updated, and its online service is free for academic research at http://gpsuber.biocuckoo.cn/.

Keywords: Post-translational modification; deep learning; lysine ubiquitination; site-specific E3-substrate relation; ubiquitin-protein ligase.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Humans
  • Lysine* / metabolism
  • Protein Processing, Post-Translational
  • Ubiquitin-Protein Ligases* / chemistry
  • Ubiquitin-Protein Ligases* / genetics
  • Ubiquitin-Protein Ligases* / metabolism
  • Ubiquitination

Substances

  • Ubiquitin-Protein Ligases
  • Lysine