Kernel Multitask Regression for Toxicogenetics

Mol Inform. 2017 Oct;36(10). doi: 10.1002/minf.201700053. Epub 2017 Sep 26.

Abstract

The development of high-throughput in vitro assays to study quantitatively the toxicity of chemical compounds on genetically characterized human-derived cell lines paves the way to predictive toxicogenetics, where one would be able to predict the toxicity of any particular compound on any particular individual. In this paper we present a machine learning-based approach for that purpose, kernel multitask regression (KMR), which combines chemical characterizations of molecular compounds with genetic and transcriptomic characterizations of cell lines to predict the toxicity of a given compound on a given cell line. We demonstrate the relevance of the method on the recent DREAM8 Toxicogenetics challenge, where it ranked among the best state-of-the-art models, and discuss the importance of choosing good descriptors for cell lines and chemicals.

Keywords: Toxicogenetics; kernel methods; machine learning; multitask regression.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Humans
  • Machine Learning
  • Regression Analysis
  • Toxicity Tests
  • Toxicogenetics / methods*