Boosting support vector machines for cancer discrimination tasks

Comput Biol Med. 2018 Oct 1:101:236-249. doi: 10.1016/j.compbiomed.2018.08.006. Epub 2018 Aug 10.

Abstract

Cancer is a complex disease that is caused by rapid alteration of genes. Prediction of the state of cancer in advance contributes to a better understanding of its mechanism and improves the cancer therapy process. For example, predicting the malignancy of tumors in advance can prevent the development of cancer through the early treatment and clinical management of tumor progression. Despite generation of extensive clinical data obtained from the high-throughput technologies, it is necessary to develop machine learning algorithms to guide the prediction process. In the study, we utilize boosting and develop three computational methods to increase the performance of support vector machines (SVM). The aforementioned methods improve the performance over existing state-of-the-art algorithms, including SVM and xgboost. We evaluate the proposed boosting approach relative to the existing algorithms by using several gene expression data related to oral cancer, breast cancer, pheochromocytomas and paragangliomas, bladder cancer, and gastric cancer. The reported results using several performance measures indicate that algorithms employing the proposed approach outperform algorithms employing the baseline approach.

Keywords: Cancer classification; Cancer genomics; Cancer identification; Machine learning; Personalized treatment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Models, Biological*
  • Neoplasms* / metabolism
  • Neoplasms* / mortality
  • Neoplasms* / therapy
  • Predictive Value of Tests
  • Support Vector Machine*