Prediction of Disordered Regions in Proteins with Recurrent Neural Networks and Protein Dynamics

J Mol Biol. 2022 Jun 30;434(12):167579. doi: 10.1016/j.jmb.2022.167579. Epub 2022 Apr 22.

Abstract

The role of intrinsically disordered protein regions (IDRs) in cellular processes has become increasingly evident over the last years. These IDRs continue to challenge structural biology experiments because they lack a well-defined conformation, and bioinformatics approaches that accurately delineate disordered protein regions remain essential for their identification and further investigation. Typically, these predictors use the protein amino acid sequence, without taking into account likely sequence-dependent emergent properties, such as protein backbone dynamics. Here we present DisoMine, a method that predicts protein'long disorder' with recurrent neural networks from simple predictions of protein dynamics, secondary structure and early folding. The tool is fast and requires only a single sequence, making it applicable for large-scale screening, including poorly studied and orphan proteins. DisoMine is a top performer in its category and compares well to disorder prediction approaches using evolutionary information. DisoMine is freely available through an interactive webserver at https://bio2byte.be/disomine/.

Keywords: Intrinsically disordered proteins; Machine learning; Neural networks; Protein backbone dynamics; Protein disorder prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Computational Biology / methods
  • Intrinsically Disordered Proteins* / chemistry
  • Neural Networks, Computer*
  • Protein Structure, Secondary
  • Sequence Analysis, Protein* / methods
  • Software*

Substances

  • Intrinsically Disordered Proteins