Highly diverse, massive organic data as explored by a composite QSPR strategy: an advanced study of boiling point

SAR QSAR Environ Res. 2005 Jun;16(3):231-46. doi: 10.1080/10659360500037115.

Abstract

An improved strategy of quantitative structure-property relationship (QSPR) studies of diverse and inhomogeneous organic datasets has been proposed. A molecular connectivity term was successively corrected for different structural features encoded in fragmental descriptors. The so-called solvation index 1chis (a weighted Randic index) was used as a "leading" variable and standardized molecular fragments were employed as "corrective" class-specific variables. Performance of the new approach was illustrated by modelling a dataset of experimental normal boiling points of 833 organic compounds belonging to 20 structural classes. Firstly, separate QSPR models were derived for each class and for eight groups of structurally similar classes. Finally, a general model formed by combining all the classes together was derived (r2=0.957, s=12.9degreesC). The strategy outlined can find application in QSPR analyses of massive, highly diverse databases of organic compounds.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Linear Models
  • Models, Chemical*
  • Organic Chemicals / chemistry*
  • Quantitative Structure-Activity Relationship*
  • Reproducibility of Results
  • Transition Temperature*

Substances

  • Organic Chemicals