Consistent Estimation of Generalized Linear Models with High Dimensional Predictors via Stepwise Regression

Entropy (Basel). 2020 Aug 31;22(9):965. doi: 10.3390/e22090965.

Abstract

Predictive models play a central role in decision making. Penalized regression approaches, such as least absolute shrinkage and selection operator (LASSO), have been widely used to construct predictive models and explain the impacts of the selected predictors, but the estimates are typically biased. Moreover, when data are ultrahigh-dimensional, penalized regression is usable only after applying variable screening methods to downsize variables. We propose a stepwise procedure for fitting generalized linear models with ultrahigh dimensional predictors. Our procedure can provide a final model; control both false negatives and false positives; and yield consistent estimates, which are useful to gauge the actual effect size of risk factors. Simulations and applications to two clinical studies verify the utility of the method.

Keywords: estimation consistency; generalized linear models; high dimensional predictors; model selection; stepwise regression.