A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method

Comput Biol Med. 2011 Nov;41(11):1006-13. doi: 10.1016/j.compbiomed.2011.08.009. Epub 2011 Sep 14.

Abstract

Breast cancer resistance protein (BCRP) is one of the key multi-drug resistance proteins, which significantly influences the therapeutic effects of many drugs, particularly anti-cancer drugs. Thus, distinguishing between substrates and non-substrates of BCRP is important not only for clinical use but also for drug discovery and development. In this study, a prediction model of the substrates and non-substrates of BCRP was developed using a modified support vector machine (SVM) method, namely GA-CG-SVM. The overall prediction accuracy of the established GA-CG-SVM model is 91.3% for the training set and 85.0% for an independent validation set. For comparison, two other machine learning methods, namely, C4.5 DT and k-NN, were also adopted to build prediction models. The results show that the GA-CG-SVM model is significantly superior to C4.5 DT and k-NN models in terms of the prediction accuracy. To sum up, the prediction model of BCRP substrates and non-substrates generated by the GA-CG-SVM method is sufficiently good and could be used as a screening tool for identifying the substrates and non-substrates of BCRP.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • ATP Binding Cassette Transporter, Subfamily G, Member 2
  • ATP-Binding Cassette Transporters / antagonists & inhibitors*
  • ATP-Binding Cassette Transporters / chemistry*
  • ATP-Binding Cassette Transporters / metabolism
  • Animals
  • Antineoplastic Agents / chemistry*
  • Antineoplastic Agents / therapeutic use
  • Breast Neoplasms / drug therapy*
  • Breast Neoplasms / metabolism
  • Drug Resistance, Neoplasm*
  • Female
  • Humans
  • Models, Biological*
  • Neoplasm Proteins / antagonists & inhibitors*
  • Neoplasm Proteins / chemistry*
  • Neoplasm Proteins / metabolism
  • Predictive Value of Tests
  • Support Vector Machine*

Substances

  • ABCG2 protein, human
  • ATP Binding Cassette Transporter, Subfamily G, Member 2
  • ATP-Binding Cassette Transporters
  • Antineoplastic Agents
  • Neoplasm Proteins