ProteinDBS: a real-time retrieval system for protein structure comparison

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W572-5. doi: 10.1093/nar/gkh436.

Abstract

We have developed a web server (ProteinDBS) for the life science community to search for similar protein tertiary structures in real time. This system applies computer visualization techniques to extract the predominant visual patterns encoded in two-dimensional distance matrices generated from the three-dimensional coordinates of protein chains. When meaningful contents, represented in a multi-dimensional feature space, have been extracted from distance matrices, an advanced indexing structure, Entropy Balanced Statistical (EBS) k-d tree, is utilized to index the data. Our system is able to return search results in ranked order from a database with 46 075 chains in seconds, exhibiting a reasonably high degree of precision. To our knowledge, this is the first real-time search engine for protein structure comparison. ProteinDBS provides two types of query method: query by Protein Data Bank protein chain ID and by new structures uploaded by users. The system is hosted at http://ProteinDBS.rnet.missouri.edu.

MeSH terms

  • Computer Graphics
  • Databases, Protein
  • Internet
  • Models, Molecular
  • Protein Structure, Tertiary*
  • Reproducibility of Results
  • Software*
  • Time Factors
  • User-Computer Interface