Sparse group variable selection for gene-environment interactions in the longitudinal study

Genet Epidemiol. 2022 Jul;46(5-6):317-340. doi: 10.1002/gepi.22461. Epub 2022 Jun 29.

Abstract

Penalized variable selection for high-dimensional longitudinal data has received much attention as it can account for the correlation among repeated measurements while providing additional and essential information for improved identification and prediction performance. Despite the success, in longitudinal studies, the potential of penalization methods is far from fully understood for accommodating structured sparsity. In this article, we develop a sparse group penalization method to conduct the bi-level gene-environment (G × $\times $ E) interaction study under the repeatedly measured phenotype. Within the quadratic inference function framework, the proposed method can achieve simultaneous identification of main and interaction effects on both the group and individual levels. Simulation studies have shown that the proposed method outperforms major competitors. In the case study of asthma data from the Childhood Asthma Management Program, we conduct G × $\times $ E study by using high-dimensional single nucleotide polymorphism data as genetic factors and the longitudinal trait, forced expiratory volume in 1 s, as the phenotype. Our method leads to improved prediction and identification of main and interaction effects with important implications.

Keywords: gene-environment interaction; longitudinal data; penalization; quadratic inference function; sparse group selection.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asthma* / genetics
  • Computer Simulation
  • Gene-Environment Interaction*
  • Humans
  • Longitudinal Studies
  • Models, Genetic