Evaluation of structural similarity based on reduced dimensionality representations of protein structure

Protein Eng Des Sel. 2004 May;17(5):425-32. doi: 10.1093/protein/gzh049. Epub 2004 Jun 8.

Abstract

Protein similarity estimations can be achieved using reduced dimensional representations and we describe a new application for the generation of two-dimensional maps from the three-dimensional structure. The code for the dimensionality reduction is based on the concept of pseudo-random generation of two-dimensional coordinates and Monte Carlo-like acceptance criteria for the generated coordinates. A new method for calculating protein similarity is developed by introducing a distance-dependent similarity field. Similarity of two proteins is derived from similarity field indices between amino acids based on various criteria such as hydrophobicity, residue replacement factors and conformational similarity, each showing a one factor Gaussian dependence. Results on comparisons of misfolded protein models with data sets of correctly folded structures show that discrimination between correctly folded and misfolded structures is possible. Tests were carried out on five different proteins, comparing a misfolded protein structure with members of the same topology, architecture, family and domain according to the CATH classification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • Data Interpretation, Statistical
  • Magnetic Resonance Spectroscopy
  • Models, Molecular*
  • Monte Carlo Method
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Structural Homology, Protein*

Substances

  • Proteins