A Practical Guide to Variable Selection in Structural Equation Models with Regularized MIMIC Models

Ross Jacobucci; Andreas M Brandmaier; Rogier A Kievit

doi:10.1177/2515245919826527

A Practical Guide to Variable Selection in Structural Equation Models with Regularized MIMIC Models

Adv Methods Pract Psychol Sci. 2019 Mar 1;2(1):55-76. doi: 10.1177/2515245919826527. Epub 2019 Mar 25.

Authors

Ross Jacobucci¹, Andreas M Brandmaier^{2

3}, Rogier A Kievit^{3

4}

Affiliations

¹ University of Notre Dame, Indiana, USA.
² Max Planck Institute for Human Development, Berlin, Germany.
³ Max Planck UCL Centre for Computational Psychiatry and Ageing Research Berlin, Germany / London, UK.
⁴ Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, UK.

Abstract

Methodological innovations have allowed researchers to consider increasingly sophisticated statistical models that are better in line with the complexities of real world behavioral data. However, despite these powerful new analytic approaches, sample sizes may not always be sufficiently large to deal with the increase in model complexity. This poses a difficult modeling scenario that entails large models with a comparably limited number of observations given the number of parameters. We here describe a particular strategy to overcoming this challenge, called regularization. Regularization, a method to penalize model complexity during estimation, has proven a viable option for estimating parameters in this small n, large p setting, but has so far mostly been used in linear regression models. Here we show how to integrate regularization within structural equation models, a popular analytic approach in psychology. We first describe the rationale behind regularization in regression contexts, and how it can be extended to regularized structural equation modeling (Jacobucci, Grimm, & McArdle, 2016). Our approach is evaluated through the use of a simulation study, showing that regularized SEM outperforms traditional SEM estimation methods in situations with a large number of predictors and small sample size. We illustrate the power of this approach in two empirical examples: modeling the neural determinants of visual short term memory, as well as identifying demographic correlates of stress, anxiety and depression. We illustrate the performance of the method and discuss practical aspects of modeling empirical data, and provide a step-by-step online tutorial.

Keywords: LASSO; MIMIC; regularization; structural equation models; variable selection.

Abstract

Grants and funding