Similarity-based descriptors (SIBAR)--a tool for safe exchange of chemical information?

J Comput Aided Mol Des. 2005 Sep-Oct;19(9-10):687-92. doi: 10.1007/s10822-005-9000-8. Epub 2005 Oct 26.

Abstract

Exchange of chemical information without disclosure of the respective structures would greatly increase the data sets available for model building. Within the framework of the ChemMask project we explored the principal applicability of SIBAR-descriptors to mask chemical structures. SIBAR is based on calculation of similarity values for each compound of the training set to a set of reference compounds. Although the SIBAR-approach per se does not allow to unambiguously trace back the chemical structure of a compound, similarity searching in a 1.5 million compound database spiked with compounds structurally analogous to the query structure lead to the retrieval of compounds structurally and pharmacologically highly analogous to the "hidden" query structure in all three examples investigated. Comparison to results obtained with the original descriptors used to calculate the SIBAR-values showed, that SIBAR indeed adds some fuzziness to the data matrix.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation*
  • Databases, Factual
  • Drug Design
  • Models, Chemical*
  • Molecular Structure