Efficient robust doubly adaptive regularized regression with applications

Stat Methods Med Res. 2019 Jul;28(7):2210-2226. doi: 10.1177/0962280218757560. Epub 2018 Feb 16.

Abstract

We consider the problem of estimation and variable selection for general linear regression models. Regularized regression procedures have been widely used for variable selection, but most existing methods perform poorly in the presence of outliers. We construct a new penalized procedure that simultaneously attains full efficiency and maximum robustness. Furthermore, the proposed procedure satisfies the oracle properties. The new procedure is designed to achieve sparse and robust solutions by imposing adaptive weights on both the decision loss and the penalty function. The proposed method of estimation and variable selection attains full efficiency when the model is correct and, at the same time, achieves maximum robustness when outliers are present. We examine the robustness properties using the finite-sample breakdown point and an influence function. We show that the proposed estimator attains the maximum breakdown point. Furthermore, there is no loss in efficiency when there are no outliers or the error distribution is normal. For practical implementation of the proposed method, we present a computational algorithm. We examine the finite-sample and robustness properties using Monte Carlo studies. Two datasets are also analyzed.

Keywords: Regularized regression; efficiency; robustness; variable selection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Attention Deficit Disorder with Hyperactivity / diagnostic imaging*
  • Attention Deficit Disorder with Hyperactivity / physiopathology
  • Child
  • Computer Simulation
  • Environmental Pollutants / analysis*
  • Female
  • Humans
  • Linear Models
  • Magnetic Resonance Imaging / methods*
  • Male
  • Models, Statistical*
  • Monte Carlo Method

Substances

  • Environmental Pollutants