Development of machine learning models based on molecular fingerprints for selection of small molecule inhibitors against JAK2 protein

J Comput Chem. 2023 Jun 15;44(16):1493-1504. doi: 10.1002/jcc.27103. Epub 2023 Mar 16.

Abstract

Janus kinase 2 (JAK2) is emerging as a potential therapeutic target for many inflammatory diseases such as myeloproliferative disorders (MPD), cancer and rheumatoid arthritis (RA). In this study, we have collected experimental data of JAK2 protein containing 6021 unique inhibitors. We then characterized them based on Morgan (ECFP6) fingerprints followed by clustering into training and test set based on their molecular scaffolds. These data were used to build the classification models with various supervised machine learning (ML) algorithms that could prioritize novel inhibitors for future drug development against JAK2 protein. The best model built by Random Forest (RF) and Morgan fingerprints achieved the G-mean value of 0.84 on the external test set. As an application of our classification model, virtual screening was performed against Drugbank molecules in order to identify the potential inhibitors based on the confidence score by RF model. Nine potential molecules were identified, which were further subject to molecular docking studies to evaluate the virtual screening results of the best RF model. This proposed method can prove useful for developing novel target-specific JAK2 inhibitors.

Keywords: JAK2; Morgan fingerprints; machine learning; scaffolds; virtual screening.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Janus Kinase 2*
  • Machine Learning*
  • Molecular Docking Simulation

Substances

  • Janus Kinase 2