Cobdock: an accurate and practical machine learning-based consensus blind docking method

Sadettin Y Ugurlu; David McDonald; Huangshu Lei; Alan M Jones; Shu Li; Henry Y Tong; Mark S Butler; Shan He

doi:10.1186/s13321-023-00793-x

Cobdock: an accurate and practical machine learning-based consensus blind docking method

J Cheminform. 2024 Jan 11;16(1):5. doi: 10.1186/s13321-023-00793-x.

Authors

Sadettin Y Ugurlu¹, David McDonald², Huangshu Lei³, Alan M Jones⁴, Shu Li⁵, Henry Y Tong⁵, Mark S Butler², Shan He^{6

7}

Affiliations

¹ School of Computer Science, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK.
² AIA Insights Ltd, Birmingham, UK.
³ YaoPharma Co. Ltd., 100 Xingguang Avenue, Renhe Town, Yubei District, Chongqing, 401121, People's Republic of China.
⁴ School of Pharmacy, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK.
⁵ Centre for Artificial Intelligence Driven Drug Discovery, Macao Polytechnic University, R. de Luís Gonzaga Gomes, Macao, 5HV2+CP8, China.
⁶ School of Computer Science, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK. s.he@cs.bham.ac.uk.
⁷ AIA Insights Ltd, Birmingham, UK. s.he@cs.bham.ac.uk.

Abstract

Probing the surface of proteins to predict the binding site and binding affinity for a given small molecule is a critical but challenging task in drug discovery. Blind docking addresses this issue by performing docking on binding regions randomly sampled from the entire protein surface. However, compared with local docking, blind docking is less accurate and reliable because the docking space is too largetly sampled. Cavity detection-guided blind docking methods improved the accuracy by using cavity detection (also known as binding site detection) tools to guide the docking procedure. However, it is worth noting that the performance of these methods heavily relies on the quality of the cavity detection tool. This constraint, namely the dependence on a single cavity detection tool, significantly impacts the overall performance of cavity detection-guided methods. To overcome this limitation, we proposed Consensus Blind Dock (CoBDock), a novel blind, parallel docking method that uses machine learning algorithms to integrate docking and cavity detection results to improve not only binding site identification but also pose prediction accuracy. Our experiments on several datasets, including PDBBind 2020, ADS, MTi, DUD-E, and CASF-2016, showed that CoBDock has better binding site and binding mode performance than other state-of-the-art cavity detector tools and blind docking methods.

Keywords: Blind molecular docking; Consensus docking; Cross-docking; Docking; Global docking; Hybrid docking; Inverse-docking; Protein docking; Reverse-docking; Small molecule docking.

Grants and funding

64243970/150.02/10836324/Turkish Government PhD sponsorship