Misprediction of Structural Disorder in Halophiles

Molecules. 2019 Jan 29;24(3):479. doi: 10.3390/molecules24030479.

Abstract

Whereas the concept of intrinsic disorder derives from biophysical observations of the lack of structure of proteins or protein regions under native conditions, many of our respective concepts rest on proteome-scale bioinformatics predictions. It is established that most predictors work reliably on proteins commonly encountered, but it is often neglected that we know very little about their performance on proteins of microorganisms that thrive in environments of extreme temperature, pH, or salt concentration, which may cause adaptive sequence composition bias. To address this issue, we predicted structural disorder for the complete proteomes of different extremophile groups by popular prediction methods and compared them to those of the reference mesophilic group. While significant deviations from mesophiles could be explained by a lack or gain of disordered regions in hyperthermophiles and radiotolerants, respectively, we found systematic overprediction in the case of halophiles. Additionally, examples were collected from the Protein Data Bank (PDB) to demonstrate misprediction and to help understand the underlying biophysical principles, i.e., halophilic proteins maintain a highly acidic and hydrophilic surface to avoid aggregation in high salt conditions. Although sparseness of data on disordered proteins from extremophiles precludes the development of dedicated general predictors, we do formulate recommendations for how to address their disorder with current bioinformatics tools.

Keywords: adaptation; charge distribution; disorder prediction; extremophile; halophile; intrinsically disordered; sequence composition bias; structurally disordered.

MeSH terms

  • Computational Biology / methods
  • Databases, Protein / statistics & numerical data
  • Models, Molecular
  • Protein Conformation
  • Proteins / chemistry*
  • Proteome / analysis

Substances

  • Proteins
  • Proteome