On the characterization of novel biologically active steroids: Selection of lipophilicity models of newly synthesized steroidal derivatives by classical and non-parametric ranking approaches

Comput Biol Chem. 2019 Jun:80:23-30. doi: 10.1016/j.compbiolchem.2019.03.001. Epub 2019 Mar 9.

Abstract

In this paper, the guidelines for the interpretation of the results of quantitative structure-retention relationship (QSRR) modeling, comparison and assessment of the established models, as well as the selection of the best and most consistent QSRR model were presented. Various linear and non-linear chemometric regression techniques were used to build QSRR models for chromatographic lipophilicity prediction of a series of triazole, tetrazole, toluenesulfonylhydrazide, nitrile, dinitrile and dione steroid derivatives. Linear regression (LR) and multiple linear regression (MLR) were used as linear techniques, while artificial neural networks (ANNs) were applied as non-linear modeling techniques. Generated models were statistically evaluated applying different approaches for model comparison and ranking. Two non-parametric methods (generalized pair correlation method - GPCM and sum of ranking differences - SRD) were used for model ranking and assessment of the best model for chromatographic lipophilicity prediction using experimentally obtained logk values and row average as a reference ranking. Both, GPCM and SRD, provided highly similar model choice regardless on a different background. These results are in agreement with the classical approach.

Keywords: Artificial neural networks; Generalized pair correlation method; Lipophilicity; Modeling; QSRR; Sum of ranking differences.

MeSH terms

  • Hydrophobic and Hydrophilic Interactions
  • Linear Models
  • Models, Chemical
  • Molecular Structure
  • Neural Networks, Computer
  • Quantitative Structure-Activity Relationship
  • Software
  • Steroids / chemistry*

Substances

  • Steroids