Identification of early liver toxicity gene biomarkers using comparative supervised machine learning

Sci Rep. 2020 Nov 5;10(1):19128. doi: 10.1038/s41598-020-76129-8.

Abstract

Screening agrochemicals and pharmaceuticals for potential liver toxicity is required for regulatory approval and is an expensive and time-consuming process. The identification and utilization of early exposure gene signatures and robust predictive models in regulatory toxicity testing has the potential to reduce time and costs substantially. In this study, comparative supervised machine learning approaches were applied to the rat liver TG-GATEs dataset to develop feature selection and predictive testing. We identified ten gene biomarkers using three different feature selection methods that predicted liver necrosis with high specificity and selectivity in an independent validation dataset from the Microarray Quality Control (MAQC)-II study. Nine of the ten genes that were selected with the supervised methods are involved in metabolism and detoxification (Car3, Crat, Cyp39a1, Dcd, Lbp, Scly, Slc23a1, and Tkfc) and transcriptional regulation (Ablim3). Several of these genes are also implicated in liver carcinogenesis, including Crat, Car3 and Slc23a1. Our biomarker gene signature provides high statistical accuracy and a manageable number of genes to study as indicators to potentially accelerate toxicity testing based on their ability to induce liver necrosis and, eventually, liver cancer.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Agrochemicals / toxicity*
  • Algorithms
  • Animals
  • Chemical and Drug Induced Liver Injury / diagnosis*
  • Chemical and Drug Induced Liver Injury / genetics
  • Gene Expression Regulation / drug effects
  • Genetic Markers*
  • Liver / drug effects*
  • Male
  • Oligonucleotide Array Sequence Analysis
  • Rats
  • Supervised Machine Learning*

Substances

  • Agrochemicals
  • Genetic Markers