Making every SAR point count: the development of Chemistry Connect for the large-scale integration of structure and bioactivity data

Drug Discov Today. 2011 Dec;16(23-24):1019-30. doi: 10.1016/j.drudis.2011.10.005. Epub 2011 Oct 14.

Abstract

The increase in drug research output from patent applications, together with the expansion of public data collections, such as ChEMBL and PubChem BioAssay, has made it essential for pharmaceutical companies to integrate both internal and external 'SAR estate'. The AstraZeneca response has been the development of an enterprise application, Chemistry Connect, containing 45 million unique chemical structures from 18 internal and external data sources. It includes merged compound-to-assay-to-result-to-target relationships extracted from patents, papers and internal data. Users can explore connections between these by searching using drug names or synonyms, chemical structures, patent numbers and target protein identifiers at a scale not previously available.

Publication types

  • Review

MeSH terms

  • Computational Biology / methods
  • Databases, Factual*
  • Drug Discovery / methods*
  • Humans
  • Pharmaceutical Preparations / chemistry*
  • Pharmacology*
  • Structure-Activity Relationship

Substances

  • Pharmaceutical Preparations