FEATURE SCREENING FOR TIME-VARYING COEFFICIENT MODELS WITH ULTRAHIGH DIMENSIONAL LONGITUDINAL DATA

Ann Appl Stat. 2016 Jun;10(2):596-617. doi: 10.1214/16-AOAS912. Epub 2016 Jul 22.

Abstract

Motivated by an empirical analysis of the Childhood Asthma Management Project, CAMP, we introduce a new screening procedure for varying coefficient models with ultrahigh dimensional longitudinal predictor variables. The performance of the proposed procedure is investigated via Monte Carlo simulation. Numerical comparisons indicate that it outperforms existing ones substantially, resulting in significant improvements in explained variability and prediction error. Applying these methods to CAMP, we are able to find a number of potentially important genetic mutations related to lung function, several of which exhibit interesting nonlinear patterns around puberty.

Keywords: Feature Selection; Functional Linear Model; Genome-Wide Association Study; Time-varying Coefficient Models; Ultrahigh Dimensional Longitudinal Data.