Development of a QSAR model to predict hepatic steatosis using freely available machine learning tools

Food Chem Toxicol. 2020 Aug:142:111494. doi: 10.1016/j.fct.2020.111494. Epub 2020 Jun 14.

Abstract

There are various types of hepatic steatosis of which non-alcoholic fatty liver disease, which may be caused by exposure to chemicals and environmental pollutants is the most prevalent, representing a potential major health risk. QSAR modelling has the potential to provide a rapid and cost-effective method to identify compounds which may trigger steatosis. Although models exist to predict key molecular initiating events of steatosis such as nuclear receptor binding, we are aware of no models to predict the apical effect steatosis. In this study, we describe the development of a QSAR model to predict steatosis using freely available machine learning tools. It was built using a dataset of 207 pharmaceuticals and pesticides which were identified as steatotic or non-steatotic from existing data from in vivo human and animal studies. The best performing model developed using the linear discriminant analysis module in TANAGRA, based on four chemical descriptors, had an accuracy of 70%, a sensitivity of 66% and a specificity of 74%. The expansion of the steatosis dataset to other chemical types, to enable the development of further models, would be of benefit in the identification of compounds with a range of mechanisms of action contributing to steatosis.

Keywords: Non-alcoholic fatty liver disease; QSAR model; Steatosis.

MeSH terms

  • Algorithms
  • Environmental Pollutants / chemistry
  • Environmental Pollutants / toxicity
  • Humans
  • Machine Learning*
  • Non-alcoholic Fatty Liver Disease / chemically induced
  • Non-alcoholic Fatty Liver Disease / metabolism*
  • Quantitative Structure-Activity Relationship

Substances

  • Environmental Pollutants