Collaborative filtering on a family of biological targets

J Chem Inf Model. 2006 Mar-Apr;46(2):626-35. doi: 10.1021/ci050367t.

Abstract

Building a QSAR model of a new biological target for which few screening data are available is a statistical challenge. However, the new target may be part of a bigger family, for which we have more screening data. Collaborative filtering or, more generally, multi-task learning, is a machine learning approach that improves the generalization performance of an algorithm by using information from related tasks as an inductive bias. We use collaborative filtering techniques for building predictive models that link multiple targets to multiple examples. The more commonalities between the targets, the better the multi-target model that can be built. We show an example of a multi-target neural network that can use family information to produce a predictive model of an undersampled target. We evaluate JRank, a kernel-based method designed for collaborative filtering. We show their performance on compound prioritization for an HTS campaign and the underlying shared representation between targets. JRank outperformed the neural network both in the single- and multi-target models.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Ligands*
  • Neural Networks, Computer*
  • Pharmaceutical Preparations / chemistry*
  • Pharmaceutical Preparations / classification
  • Quantitative Structure-Activity Relationship*

Substances

  • Ligands
  • Pharmaceutical Preparations