The Effects of Sample Size on the Estimation of Regression Mixture Models

Educ Psychol Meas. 2019 Apr;79(2):358-384. doi: 10.1177/0013164418791673. Epub 2018 Aug 10.

Abstract

Regression mixture models are a statistical approach used for estimating heterogeneity in effects. This study investigates the impact of sample size on regression mixture's ability to produce "stable" results. Monte Carlo simulations and analysis of resamples from an application data set were used to illustrate the types of problems that may occur with small samples in real data sets. The results suggest that (a) when class separation is low, very large sample sizes may be needed to obtain stable results; (b) it may often be necessary to consider a preponderance of evidence in latent class enumeration; (c) regression mixtures with ordinal outcomes result in even more instability; and (d) with small samples, it is possible to obtain spurious results without any clear indication of there being a problem.

Keywords: heterogeneous effects; regression mixture models; sample size.