The efficiency of ligand-receptor interaction information alone as new descriptors in QSAR modeling via random forest artificial neural network

Chem Biol Drug Des. 2020 Aug;96(2):812-824. doi: 10.1111/cbdd.13690. Epub 2020 May 5.

Abstract

A new approach is introduced for the construction of a predictive quantitative structure-activity relationship model in which only ligand-receptor (LR) interaction features are used as relevant descriptors. This approach combines the benefit of the random forest (RF) as a new variable selection method with the intrinsic capability of the artificial neural network (ANN). The interaction information of the ligand-receptor (LR) complex was used as molecular docking descriptors. The most relevant descriptors were selected using the RF technique and used as inputs of ANN. The proposed RF ANN (RF-LM-ANN) method was optimized and then evaluated by the prediction of pEC50 for some of the azine derivatives as non-nucleoside reverse transcriptase inhibitors. RF-LM-ANN model under the optimal conditions was evaluated using internal (validation) and external test sets. The determination coefficients of the external test and validation sets were 0.88 and 0.89, respectively. The mean square deviation (MSE) values for the prediction of biological activities in the external test and validation sets were found to be 0.10 and 0.11, respectively. The results obtained demonstrated the good prediction ability and high generalizability of the proposed RF-LM-ANN model based on the MMDs alone.

Keywords: HIV; QSAR; artificial neural network; molecular docking; random forest.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA-Directed RNA Polymerases / antagonists & inhibitors*
  • Enzyme Inhibitors / chemistry*
  • Heterocyclic Compounds / chemistry*
  • Ligands
  • Molecular Docking Simulation
  • Neural Networks, Computer
  • Protein Binding
  • Quantitative Structure-Activity Relationship

Substances

  • Enzyme Inhibitors
  • Heterocyclic Compounds
  • Ligands
  • DNA-Directed RNA Polymerases