A Computational Software for Training Robust Drug-Target Affinity Prediction Models: pydebiaseddta

Melİh Barsbey; Riza ÖZçelİk; Alperen Bağ; Berk Atil; Arzucan ÖZgür; Elif Ozkirimli

doi:10.1089/cmb.2023.0194

A Computational Software for Training Robust Drug-Target Affinity Prediction Models: pydebiaseddta

J Comput Biol. 2023 Nov;30(11):1240-1245. doi: 10.1089/cmb.2023.0194.

Authors

Melİh Barsbey¹, Riza ÖZçelİk¹, Alperen Bağ², Berk Atil¹, Arzucan ÖZgür¹, Elif Ozkirimli³

Affiliations

¹ Department of Computer Engineering, Boğaziçi University, İstanbul, Turkey.
² Technical University of Munich, Munich, Germany.
³ Roche Informatics, F. Hoffmann-La Roche AG, Basel, Switzerland.

PMID: 37988394
DOI: 10.1089/cmb.2023.0194

Abstract

Robust generalization of drug-target affinity (DTA) prediction models is a notoriously difficult problem in computational drug discovery. In this article, we present pydebiaseddta: a computational software for improving the generalizability of DTA prediction models to novel ligands and/or proteins. pydebiaseddta serves as the practical implementation of the DebiasedDTA training framework, which advocates modifying the training distribution to mitigate the effect of spurious correlations in the training data set that leads to substantially degraded performance for novel ligands and proteins. Written in Python programming language, pydebiaseddta combines a user-friendly streamlined interface with a feature-rich and highly modifiable architecture. With this article we introduce our software, showcase its main functionalities, and describe practical ways for new users to engage with it.

Keywords: computational drug discovery; drug–target affinity; importance weighting; out-of-distribution generalization; spurious correlation; virtual drug screening.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Drug Discovery
Programming Languages*
Proteins
Software*

Substances

Proteins