Doubly regularized estimation and selection in linear mixed-effects models for high-dimensional longitudinal data

Stat Interface. 2018;11(4):721-737. doi: 10.4310/SII.2018.v11.n4.a15. Epub 2018 Sep 19.

Abstract

The linear mixed-effects model (LMM) is widely used in the analysis of clustered or longitudinal data. This paper aims to address analytic challenges arising from estimation and selection in the application of the LMM to high-dimensional longitudinal data. We develop a doubly regularized approach in the LMM to simultaneously select fixed and random effects. On the theoretical front, we establish large sample properties for the proposed method under the high-dimensional setting, allowing both numbers of fixed effects and random effects to be much larger than the sample size. We present new regularity conditions for the diverging rates, under which the proposed method achieves both estimation and selection consistency. In addition, we propose a new algorithm that solves the related optimization problem effectively so that its computational cost is comparable with that of the Newton-Raphson algorithm for maximum likelihood estimator in the LMM. Through simulation studies we assess performances of the proposed regularized LMM in both aspects of variable selection and estimation. We also illustrate the proposed method by two data analysis examples.

Keywords: Diverging rate; Primary 62J05, 62J07; Random-effects selection; Regularization; Variable selection; secondary 62F12.