Drawing inferences for high-dimensional linear models: A selection-assisted partial regression and smoothing approach

Biometrics. 2019 Jun;75(2):551-561. doi: 10.1111/biom.13013. Epub 2019 Mar 29.

Abstract

Drawing inferences for high-dimensional models is challenging as regular asymptotic theories are not applicable. This article proposes a new framework of simultaneous estimation and inferences for high-dimensional linear models. By smoothing over partial regression estimates based on a given variable selection scheme, we reduce the problem to low-dimensional least squares estimations. The procedure, termed as Selection-assisted Partial Regression and Smoothing (SPARES), utilizes data splitting along with variable selection and partial regression. We show that the SPARES estimator is asymptotically unbiased and normal, and derive its variance via a nonparametric delta method. The utility of the procedure is evaluated under various simulation scenarios and via comparisons with the de-biased LASSO estimators, a major competitor. We apply the method to analyze two genomic datasets and obtain biologically meaningful results.

Keywords: Selection-assisted Partial Regression and Smoothing (SPARES); confidence intervals; high-dimensional inference; hypothesis testing; multisample-splitting.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computer Simulation
  • Genomics / statistics & numerical data
  • Humans
  • Least-Squares Analysis
  • Linear Models*
  • Regression Analysis