Dirichlet-Laplace priors for optimal shrinkage

J Am Stat Assoc. 2015 Dec 1;110(512):1479-1490. doi: 10.1080/01621459.2014.960967. Epub 2014 Sep 25.

Abstract

Penalized regression methods, such as L1 regularization, are routinely used in high-dimensional applications, and there is a rich literature on optimality properties under sparsity assumptions. In the Bayesian paradigm, sparsity is routinely induced through two-component mixture priors having a probability mass at zero, but such priors encounter daunting computational problems in high dimensions. This has motivated continuous shrinkage priors, which can be expressed as global-local scale mixtures of Gaussians, facilitating computation. In contrast to the frequentist literature, little is known about the properties of such priors and the convergence and concentration of the corresponding posterior distribution. In this article, we propose a new class of Dirichlet-Laplace priors, which possess optimal posterior concentration and lead to efficient posterior computation. Finite sample performance of Dirichlet-Laplace priors relative to alternatives is assessed in simulated and real data examples.

Keywords: Bayesian; Convergence rate; High dimensional; L1; Lasso; Penalized regression; Regularization; Shrinkage prior.