High-throughput sequencing for the identification of binding molecules from DNA-encoded chemical libraries

Bioorg Med Chem Lett. 2010 Jul 15;20(14):4188-92. doi: 10.1016/j.bmcl.2010.05.053. Epub 2010 May 20.

Abstract

DNA-encoded chemical libraries are large collections of small organic molecules, individually coupled to DNA fragments that serve as amplifiable identification bar codes. The isolation of specific binders requires a quantitative analysis of the distribution of DNA fragments in the library before and after capture on an immobilized target protein of interest. Here, we show how Illumina sequencing can be applied to the analysis of DNA-encoded chemical libraries, yielding over 10 million DNA sequence tags per flow-lane. The technology can be used in a multiplex format, allowing the encoding and subsequent sequencing of multiple selections in the same experiment. The sequence distributions in DNA-encoded chemical library selections were found to be similar to the ones obtained using 454 technology, thus reinforcing the concept that DNA sequencing is an appropriate avenue for the decoding of library selections. The large number of sequences obtained with the Illumina method now enables the study of very large DNA-encoded chemical libraries (>500,000 compounds) and reduces decoding costs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Combinatorial Chemistry Techniques*
  • DNA / chemistry*

Substances

  • DNA