Systematic analysis of large screening sets in drug discovery

Paul E Blower Jr; Kevin P Cross; Michael A Fligner; Glenn J Myatt; Joseph S Verducci; Chihae Yang

doi:10.2174/1570163043484879

Systematic analysis of large screening sets in drug discovery

Curr Drug Discov Technol. 2004 Jan;1(1):37-47. doi: 10.2174/1570163043484879.

Authors

Paul E Blower Jr¹, Kevin P Cross, Michael A Fligner, Glenn J Myatt, Joseph S Verducci, Chihae Yang

Affiliation

¹ Leadscope, Inc., 1275 Kinnear Road, Columbus, OH 43212, USA. pblower@leadscope.com

PMID: 16472218
DOI: 10.2174/1570163043484879

Abstract

Each year large pharmaceutical companies produce massive amounts of primary screening data for lead discovery. To make better use of the vast amount of information in pharmaceutical databases, companies have begun to scrutinize the lead generation stage to ensure that more and better qualified lead series enter the downstream optimization and development stages. This article describes computational techniques for end to end analysis of large drug discovery screening sets. The analysis proceeds in three stages: In stage 1 the initial screening set is filtered to remove compounds that are unsuitable as lead compounds. In stage 2 local structural neighborhoods around active compound classes are identified, including similar but inactive compounds. In stage 3 the structure-activity relationships within local structural neighborhoods are analyzed. These processes are illustrated by analyzing two large, publicly available databases.

MeSH terms

Algorithms
Data Interpretation, Statistical
Databases, Factual
Drug Design*
Drug Evaluation, Preclinical*
Pharmaceutical Preparations / classification
Pharmacology / trends*
Structure-Activity Relationship
Terminology as Topic

Substances

Pharmaceutical Preparations