Model selection based on combined penalties for biomarker identification

Eleni Vradi; Werner Brannath; Thomas Jaki; Richardus Vonk

doi:10.1080/10543406.2017.1378662

Model selection based on combined penalties for biomarker identification

J Biopharm Stat. 2018;28(4):735-749. doi: 10.1080/10543406.2017.1378662. Epub 2017 Oct 26.

Authors

Eleni Vradi¹, Werner Brannath², Thomas Jaki³, Richardus Vonk¹

Affiliations

¹ a Department of Research and Clinical Sciences Statistics , Bayer AG , Berlin , Germany.
² b Institute of Statistics, Competence Center for Clinical Trials Bremen , Faculty 3, University of Bremen , Bremen , Germany.
³ c Department of Mathematics and Statistics , Medical and Pharmaceutical Statistics Research Unit, Lancaster University , Lancaster , United Kingdom.

PMID: 29072549
DOI: 10.1080/10543406.2017.1378662

Abstract

The growing role of targeted medicine has led to an increased focus on the development of actionable biomarkers. Current penalized selection methods that are used to identify biomarker panels for classification in high-dimensional data, however, often result in highly complex panels that need careful pruning for practical use. In the framework of regularization methods, a penalty that is a weighted sum of the L₁ and L₀ norm has been proposed to account for the complexity of the resulting model. In practice, the limitation of this penalty is that the objective function is non-convex, non-smooth, the optimization is computationally intensive and the application to high-dimensional settings is challenging. In this paper, we propose a stepwise forward variable selection method which combines the L₀ with L₁ or L₂ norms. The penalized likelihood criterion that is used in the stepwise selection procedure results in more parsimonious models, keeping only the most relevant features. Simulation results and a real application show that our approach exhibits a comparable performance with common selection methods with respect to the prediction performance while minimizing the number of variables in the selected model resulting in a more parsimonious model as desired.

Keywords: Biomarker panels; combined penalties; model selection; penalized regression; regularization; sparsity; stepwise variable selection; treatment responder.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Biomarkers
Computer Simulation / statistics & numerical data*
Databases, Factual*
Humans
Models, Biological*

Substances

Biomarkers

Grants and funding

SRF-2015-08-001/DH_/Department of Health/United Kingdom