Model selection based on combined penalties for biomarker identification

J Biopharm Stat. 2018;28(4):735-749. doi: 10.1080/10543406.2017.1378662. Epub 2017 Oct 26.

Abstract

The growing role of targeted medicine has led to an increased focus on the development of actionable biomarkers. Current penalized selection methods that are used to identify biomarker panels for classification in high-dimensional data, however, often result in highly complex panels that need careful pruning for practical use. In the framework of regularization methods, a penalty that is a weighted sum of the L1 and L0 norm has been proposed to account for the complexity of the resulting model. In practice, the limitation of this penalty is that the objective function is non-convex, non-smooth, the optimization is computationally intensive and the application to high-dimensional settings is challenging. In this paper, we propose a stepwise forward variable selection method which combines the L0 with L1 or L2 norms. The penalized likelihood criterion that is used in the stepwise selection procedure results in more parsimonious models, keeping only the most relevant features. Simulation results and a real application show that our approach exhibits a comparable performance with common selection methods with respect to the prediction performance while minimizing the number of variables in the selected model resulting in a more parsimonious model as desired.

Keywords: Biomarker panels; combined penalties; model selection; penalized regression; regularization; sparsity; stepwise variable selection; treatment responder.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers
  • Computer Simulation / statistics & numerical data*
  • Databases, Factual*
  • Humans
  • Models, Biological*

Substances

  • Biomarkers