A large-scale comparison of computational models on the residue flexibility for NMR-derived proteins

Protein Pept Lett. 2012 Feb;19(2):244-51. doi: 10.2174/092986612799080301.

Abstract

As an alternative to X-ray crystallography, nuclear magnetic resonance (NMR) has also emerged as the method of choice for studying both protein structure and dynamics in solution. However, little work using computational models such as Gaussian network model (GNM) and machine learning approaches has focused on NMR-derived proteins to predict the residue flexibility, which is represented by the root mean square deviation (RMSD) with respect to the average structure. We provide a large-scale comparison of computational models, including GNM, parameter-free GNM and several linear regression models using local solvent exposures as inputs, based on a dataset of 1609 protein chains whose structures were resolved by NMR. The result again confirmed that the correlation of GNM outputs with raw RMSD values was better than that using B-factors of X-ray data. Nevertheless, it was also concluded that the parameter-free GNM and the solvent exposure based linear regression models performed worse than GNM when predicting RMSD, contrary to results using X-ray data. The discrepancy of residue flexibility prediction between NMR and X-ray data is likely attributable to a combination of their physical and methodological differences.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Animals
  • Computational Biology / methods
  • Computer Simulation*
  • Crystallography, X-Ray
  • Databases, Protein
  • High-Throughput Screening Assays / methods
  • Humans
  • Models, Molecular
  • Normal Distribution
  • Nuclear Magnetic Resonance, Biomolecular*
  • Protein Conformation
  • Protein Folding*
  • Protein Interaction Domains and Motifs / physiology*
  • Proteins / chemistry*
  • Proteins / metabolism
  • Sequence Analysis, Protein / methods

Substances

  • Proteins