Measurement error in earnings data: Using a mixture model approach to combine survey and register data

Erik Meijer; Susann Rohwedder; Tom Wansbeek

doi:10.1198/jbes.2011.08166

Measurement error in earnings data: Using a mixture model approach to combine survey and register data

J Bus Econ Stat. 2012;30(2):191-201. doi: 10.1198/jbes.2011.08166. Epub 2012 May 24.

Authors

Erik Meijer¹, Susann Rohwedder, Tom Wansbeek

Affiliation

¹ RAND Corporation, Santa Monica, CA 90407-2138.

Abstract

Survey data on earnings tend to contain measurement error. Administrative data are superior in principle, but they are worthless in case of a mismatch. We develop methods for prediction in mixture factor analysis models that combine both data sources to arrive at a single earnings figure. We apply the methods to a Swedish data set. Our results show that register earnings data perform poorly if there is a (small) probability of a mismatch. Survey earnings data are more reliable, despite their measurement error. Predictors that combine both and take conditional class probabilities into account outperform all other predictors.

Keywords: Factor score; administrative data; finite mixture; structural equation model; validation study.

Abstract

Grants and funding