Boosted feature selectors: a case study on prediction P-gp inhibitors and substrates

J Comput Aided Mol Des. 2018 Nov;32(11):1273-1294. doi: 10.1007/s10822-018-0171-5. Epub 2018 Oct 26.

Abstract

Feature selection is commonly used as a preprocessing step to machine learning for improving learning performance, lowering computational complexity and facilitating model interpretation. This paper proposes the application of boosting feature selection to improve the classification performance of standard feature selection algorithms evaluated for the prediction of P-gp inhibitors and substrates. Two well-known classification algorithms, decision trees and support vector machines, were used to classify the chemical compounds. The experimental results showed better performance for boosting feature selection with respect to the standard feature selection algorithms while maintaining the capability for feature reduction.

Keywords: Feature selection; Molecular activity prediction; P-gp inhibitors and substrates; QSAR.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • ATP Binding Cassette Transporter, Subfamily B / antagonists & inhibitors*
  • ATP Binding Cassette Transporter, Subfamily B / chemistry*
  • Algorithms
  • Decision Trees
  • Ligands*
  • Machine Learning*
  • Molecular Structure
  • Protein Binding
  • Quantitative Structure-Activity Relationship
  • Support Vector Machine

Substances

  • ATP Binding Cassette Transporter, Subfamily B
  • Ligands