Inactive-enriched machine-learning models exploiting patent data improve structure-based virtual screening for PDL1 dimerizers

J Adv Res. 2024 Jan 26:S2090-1232(24)00037-7. doi: 10.1016/j.jare.2024.01.024. Online ahead of print.

Abstract

Introduction: Small-molecule Programmable Cell Death Protein 1/Programmable Death-Ligand 1 (PD1/PDL1) inhibition via PDL1 dimerization has the potential to lead to inexpensive drugs with better cancer patient outcomes and milder side effects. However, this therapeutic approach has proven challenging, with only one PDL1 dimerizer reaching early clinical trials so far. There is hence a need for fast and accurate methods to develop alternative PDL1 dimerizers.

Objectives: We aim to show that structure-based virtual screening (SBVS) based on PDL1-specific machine-learning (ML) scoring functions (SFs) is a powerful drug design tool for detecting PD1/PDL1 inhibitors via PDL1 dimerization.

Methods: By incorporating the latest MLSF advances, we generated and evaluated PDL1-specific MLSFs (classifiers and inactive-enriched regressors) on two demanding test sets.

Results: 60 PDL1-specific MLSFs (30 classifiers and 30 regressors) were generated. Our large-scale analysis provides highly predictive PDL1-specific MLSFs that benefitted from training with large volumes of docked inactives and enabling inactive-enriched regression.

Conclusion: PDL1-specific MLSFs strongly outperformed generic SFs of various types on this target and are released here without restrictions.

Keywords: Artificial intelligence; Docking; Immunotherapy; Machine learning; PD1; PDL1; Virtual screening.