Robustness of biological activity spectra predicting by computer program PASS for noncongeneric sets of chemical compounds

J Chem Inf Comput Sci. 2000 Nov-Dec;40(6):1349-55. doi: 10.1021/ci000383k.

Abstract

The computer system PASS provides simultaneous prediction of several hundreds of biological activity types for any drug-like compound. The prediction is based on the analysis of structure-activity relationships of the training set including more than 30000 known biologically active compounds. In this paper we investigate the influence on the accuracy of predicting the types of activity with PASS by (a) reduction of the number of structures in the training set and (b) reduction of the number of known activities in the training set. The compounds from the MDDR database are used to create heterogeneous training and evaluation sets. We demonstrate that predictions are robust despite the exclusion of up to 60% of information.

MeSH terms

  • Chemistry, Pharmaceutical / methods*
  • Databases, Factual
  • Drug Evaluation, Preclinical
  • Software*
  • Structure-Activity Relationship