Optimal decision-making in high-throughput virtual screening pipelines

Patterns (N Y). 2023 Nov 3;4(11):100875. doi: 10.1016/j.patter.2023.100875. eCollection 2023 Nov 10.

Abstract

The need for efficient computational screening of molecular candidates that possess desired properties frequently arises in various scientific and engineering problems, including drug discovery and materials design. However, the enormous search space containing the candidates and the substantial computational cost of high-fidelity property prediction models make screening practically challenging. In this work, we propose a general framework for constructing and optimizing a high-throughput virtual screening (HTVS) pipeline that consists of multi-fidelity models. The central idea is to optimally allocate the computational resources to models with varying costs and accuracy to optimize the return on computational investment. Based on both simulated and real-world data, we demonstrate that the proposed optimal HTVS framework can significantly accelerate virtual screening without any degradation in terms of accuracy. Furthermore, it enables an adaptive operational strategy for HTVS, where one can trade accuracy for efficiency.

Keywords: HTS; HTVS; ROCI; high-throughput screening; high-throughput virtual screening pipeline; optimal computational campaign; optimal decision-making; optimal screening; return on computational investment.