The Origin of Discrepancies between Predictions and Annotations in Intrinsically Disordered Proteins

Biomolecules. 2023 Sep 25;13(10):1442. doi: 10.3390/biom13101442.

Abstract

Disorder prediction methods that can discriminate between ordered and disordered regions have contributed fundamentally to our understanding of the properties and prevalence of intrinsically disordered proteins (IDPs) in proteomes as well as their functional roles. However, a recent large-scale assessment of the performance of these methods indicated that there is still room for further improvements, necessitating novel approaches to understand the strengths and weaknesses of individual methods. In this study, we compared two methods, IUPred and disorder prediction, based on the pLDDT scores derived from AlphaFold2 (AF2) models. We evaluated these methods using a dataset from the DisProt database, consisting of experimentally characterized disordered regions and subsets associated with diverse experimental methods and functions. IUPred and AF2 provided consistent predictions in 79% of cases for long disordered regions; however, for 15% of these cases, they both suggested order in disagreement with annotations. These discrepancies arose primarily due to weak experimental support, the presence of intermediate states, or context-dependent behavior, such as binding-induced transitions. Furthermore, AF2 tended to predict helical regions with high pLDDT scores within disordered segments, while IUPred had limitations in identifying linker regions. These results provide valuable insights into the inherent limitations and potential biases of disorder prediction methods.

Keywords: AlphaFold2; DisProt; IUPred; disorder prediction methods; disorder to order transition; flexible linker; intrinsically disordered proteins; intrinsically disordered regions; molten globule.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Factual
  • Furylfuramide
  • Intrinsically Disordered Proteins* / metabolism
  • Protein Conformation
  • Proteome / metabolism

Substances

  • Intrinsically Disordered Proteins
  • Furylfuramide
  • Proteome

Grants and funding

This publication is part of a project that has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No. 778247. This work was supported by ELIXIR, the research infrastructure for life-science data.