Gene selection for microarray gene expression classification using Bayesian Lasso quantile regression

Comput Biol Med. 2018 Jun 1:97:145-152. doi: 10.1016/j.compbiomed.2018.04.018. Epub 2018 Apr 27.

Abstract

Gene selection has been proven to be an effective way to improve the results of many classification methods. However, existing gene selection techniques in binary classification regression are sensitive to outliers of the data, heteroskedasticity or other anomalies of the latent response. In this paper, we propose a new Bayesian hierarchical model to overcome these problems in a relatively straightforward way. In particular, we propose a new Bayesian Lasso method that employs a skewed Laplace distribution for the errors and a scaled mixture of uniform distribution for the regression parameters, together with Bayesian MCMC estimation. Comprehensive comparisons between our proposed gene selection method and other competitor methods are performed experimentally, depending on four benchmark gene expression datasets. The experimental results prove that the proposed method is very effective for selecting the most relevant genes with high classification accuracy.

Keywords: Bayesian hierarchical model; Classification; Gene selection; Lasso; Quantile regression.

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Computational Biology
  • Databases, Genetic
  • Gene Expression Profiling / methods*
  • Humans
  • Models, Statistical*
  • Neoplasms / classification
  • Neoplasms / genetics
  • Neoplasms / metabolism
  • Oligonucleotide Array Sequence Analysis / methods*
  • Regression Analysis
  • Transcriptome / genetics*