Cell Painting-based bioactivity prediction boosts high-throughput screening hit-rates and compound diversity

Nat Commun. 2024 Apr 24;15(1):3470. doi: 10.1038/s41467-024-47171-1.

Abstract

Identifying active compounds for a target is a time- and resource-intensive task in early drug discovery. Accurate bioactivity prediction using morphological profiles could streamline the process, enabling smaller, more focused compound screens. We investigate the potential of deep learning on unrefined single-concentration activity readouts and Cell Painting data, to predict compound activity across 140 diverse assays. We observe an average ROC-AUC of 0.744 ± 0.108 with 62% of assays achieving ≥0.7, 30% ≥0.8, and 7% ≥0.9. In many cases, the high prediction performance can be achieved using only brightfield images instead of multichannel fluorescence images. A comprehensive analysis shows that Cell Painting-based bioactivity prediction is robust across assay types, technologies, and target classes, with cell-based assays and kinase targets being particularly well-suited for prediction. Experimental validation confirms the enrichment of active compounds. Our findings indicate that models trained on Cell Painting data, combined with a small set of single-concentration data points, can reliably predict the activity of a compound library across diverse targets and assays while maintaining high hit rates and scaffold diversity. This approach has the potential to reduce the size of screening campaigns, saving time and resources, and enabling primary screening with more complex assays.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning
  • Drug Discovery* / methods
  • High-Throughput Screening Assays* / methods
  • Humans
  • Small Molecule Libraries / chemistry
  • Small Molecule Libraries / pharmacology

Substances

  • Small Molecule Libraries