HeMoQuest: a webserver for qualitative prediction of transient heme binding to protein motifs

BMC Bioinformatics. 2020 Mar 27;21(1):124. doi: 10.1186/s12859-020-3420-2.

Abstract

Background: The notion of heme as a regulator of many physiological processes via transient binding to proteins is one that is recently being acknowledged. The broad spectrum of the effects of heme makes it important to identify further heme-regulated proteins to understand physiological and pathological processes. Moreover, several proteins were shown to be functionally regulated by interaction with heme, yet, for some of them the heme-binding site(s) remain unknown. The presented application HeMoQuest enables identification and qualitative evaluation of such heme-binding motifs from protein sequences.

Results: We present HeMoQuest, an online interface (http://bit.ly/hemoquest) to algorithms that provide the user with two distinct qualitative benefits. First, our implementation rapidly detects transient heme binding to nonapeptide motifs from protein sequences provided as input. Additionally, the potential of each predicted motif to bind heme is qualitatively gauged by assigning binding affinities predicted by an ensemble learning implementation, trained on experimentally determined binding affinity data. Extensive testing of our implementation on both existing and new manually curated datasets reveal that our method produces an unprecedented level of accuracy (92%) in identifying those residues assigned "heme binding" in all of the datasets used. Next, the machine learning implementation for the prediction and qualitative assignment of binding affinities to the predicted motifs achieved 71% accuracy on our data.

Conclusions: Heme plays a crucial role as a regulatory molecule exerting functional consequences via transient binding to surfaces of target proteins. HeMoQuest is designed to address this imperative need for a computational approach that enables rapid detection of heme-binding motifs from protein datasets. While most existing implementations attempt to predict sites of permanent heme binding, this application is to the best of our knowledge, the first of its kind to address the significance of predicting transient heme binding to proteins.

Keywords: Heme; Heme-binding site prediction; Heme-regulated protein; Machine learning; Transient heme binding; Web application.

MeSH terms

  • Algorithms
  • Amino Acid Motifs*
  • Binding Sites
  • Heme / metabolism*
  • Internet
  • Machine Learning
  • Protein Binding
  • Sequence Analysis, Protein
  • Software*

Substances

  • Heme