Computational prediction of disordered binding regions

Comput Struct Biotechnol J. 2023 Feb 10:21:1487-1497. doi: 10.1016/j.csbj.2023.02.018. eCollection 2023.

Abstract

One of the key features of intrinsically disordered regions (IDRs) is their ability to interact with a broad range of partner molecules. Multiple types of interacting IDRs were identified including molecular recognition fragments (MoRFs), short linear sequence motifs (SLiMs), and protein-, nucleic acids- and lipid-binding regions. Prediction of binding IDRs in protein sequences is gaining momentum in recent years. We survey 38 predictors of binding IDRs that target interactions with a diverse set of partners, such as peptides, proteins, RNA, DNA and lipids. We offer a historical perspective and highlight key events that fueled efforts to develop these methods. These tools rely on a diverse range of predictive architectures that include scoring functions, regular expressions, traditional and deep machine learning and meta-models. Recent efforts focus on the development of deep neural network-based architectures and extending coverage to RNA, DNA and lipid-binding IDRs. We analyze availability of these methods and show that providing implementations and webservers results in much higher rates of citations/use. We also make several recommendations to take advantage of modern deep network architectures, develop tools that bundle predictions of multiple and different types of binding IDRs, and work on algorithms that model structures of the resulting complexes.

Keywords: CAID, Critical Assessment of Intrinsic Disorder; CASP, Critical Assessment of techniques for protein Structure Prediction; DL, deep learning; Disordered binding regions; IDP, intrinsically disordered protein; IDR, intrinsically disordered region; Intrinsic disorder; ML, machine learning; MoRF, molecular recognition fragment; Molecular recognition features; NN, neural network; Protein-lipid interactions; Protein-nucleic acids interactions; Protein-protein interactions; SLiM, short linear sequence motif; Short linear motifs.

Publication types

  • Review