QSARINS-chem: Insubria datasets and new QSAR/QSPR models for environmental pollutants in QSARINS

J Comput Chem. 2014 May 15;35(13):1036-44. doi: 10.1002/jcc.23576. Epub 2014 Mar 5.

Abstract

A database of environmentally hazardous chemicals, collected and modeled by QSAR by the Insubria group, is included in the updated version of QSARINS, software recently proposed for the development and validation of QSAR models by the genetic algorithm-ordinary least squares method. In this version, a module, named QSARINS-Chem, includes several datasets of chemical structures and their corresponding endpoints (physicochemical properties and biological activities). The chemicals are accessible in different ways (CAS, SMILES, names and so forth) and their three-dimensional structure can be visualized. Some of the QSAR models, previously published by our group, have been redeveloped using the free online software for molecular descriptor calculation, PaDEL-Descriptor. The new models can be easily applied for future predictions on chemicals without experimental data, also verifying the applicability domain to new chemicals. The QSAR model reporting format (QMRF) of these models is also here downloadable. Additional chemometric analyses can be done by principal component analysis and multicriteria decision making for screening and ranking chemicals to prioritize the most dangerous.

Keywords: PaDEL-Descriptor models; QMRF; datasets; environmental chemicals; ranking.