Biological and functional relevance of CASP predictions

Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):374-386. doi: 10.1002/prot.25396. Epub 2017 Oct 17.

Abstract

Our goal is to answer the question: compared with experimental structures, how useful are predicted models for functional annotation? We assessed the functional utility of predicted models by comparing the performances of a suite of methods for functional characterization on the predictions and the experimental structures. We identified 28 sites in 25 protein targets to perform functional assessment. These 28 sites included nine sites with known ligand binding (holo-sites), nine sites that are expected or suggested by experimental authors for small molecule binding (apo-sites), and Ten sites containing important motifs, loops, or key residues with important disease-associated mutations. We evaluated the utility of the predictions by comparing their microenvironments to the experimental structures. Overall structural quality correlates with functional utility. However, the best-ranked predictions (global) may not have the best functional quality (local). Our assessment provides an ability to discriminate between predictions with high structural quality. When assessing ligand-binding sites, most prediction methods have higher performance on apo-sites than holo-sites. Some servers show consistently high performance for certain types of functional sites. Finally, many functional sites are associated with protein-protein interaction. We also analyzed biologically relevant features from the protein assemblies of two targets where the active site spanned the protein-protein interface. For the assembly targets, we find that the features in the models are mainly determined by the choice of template.

Keywords: CASP; binding site; functional relevance; microenvironment; protein structure prediction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Biological Products / metabolism*
  • Catalytic Domain
  • Computational Biology / methods*
  • Humans
  • Ligands
  • Models, Molecular*
  • Models, Statistical*
  • Protein Binding
  • Protein Conformation*
  • Proteins / chemistry*
  • Proteins / metabolism*

Substances

  • Biological Products
  • Ligands
  • Proteins