A novel search engine for virtual screening of very large databases

J Chem Inf Model. 2006 Mar-Apr;46(2):836-43. doi: 10.1021/ci050458q.

Abstract

Virtual screening of large chemical databases using the structure of the receptor can be computationally very demanding. We present a novel strategy that combines exhaustive similarity searches directly in SMILES format with the docking of flexible ligands, whose 3D structure is generated on the fly from the SMILES representation. Our strategy makes use of the recently developed LINGO tools to extract implicit chemical information from SMILES strings and integrates LINGO similarities into a pseudo-evolutionary algorithm. The algorithm represents a combination of a fast target-independent similarity method with a slower but information richer target-focused method. A virtual search of FactorXa ligands provided 62% of the potential hits after docking only 6.5% of a database of nearly 1 million molecules. The set of solutions showed good diversity, indicating that the method shows good scaffold hopping capabilities.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chemical Phenomena
  • Chemistry
  • Databases as Topic*
  • Drug Design*
  • Drug Evaluation, Preclinical / methods*
  • Factor Xa / chemistry*
  • Factor Xa / metabolism
  • Ligands*
  • Software*

Substances

  • Ligands
  • Factor Xa