Stable unstable reliability theory

Hoben Thomas; Arnold Lohaus; Holger Domsch

doi:10.1111/j.2044-8317.2010.02011.x

Stable unstable reliability theory

Br J Math Stat Psychol. 2012 May;65(2):201-21. doi: 10.1111/j.2044-8317.2010.02011.x. Epub 2011 Feb 2.

Authors

Hoben Thomas¹, Arnold Lohaus, Holger Domsch

Affiliation

¹ Penn State University, Pennsylvania, USA. hxt@psu.edu

PMID: 22500569
DOI: 10.1111/j.2044-8317.2010.02011.x

Abstract

Classical reliability theory assumes that individuals have identical true scores on both testing occasions, a condition described as stable. If some individuals' true scores are different on different testing occasions, described as unstable, the estimated reliability can be misleading. A model called stable unstable reliability theory (SURT) frames stability or instability as an empirically testable question. SURT assumes a mixed population of stable and unstable individuals in unknown proportions, with w(i) the probability that individual i is stable. w(i) becomes i's test score weight which is used to form a weighted correlation coefficient r(w) which is reliability under SURT. If all w(i) = 1 then r(w) is the classical reliability coefficient; thus classical theory is a special case of SURT. Typically r(w) is larger than the conventional reliability r, and confidence intervals on true scores are typically shorter than conventional intervals. r(w) is computed with routines in a publicly available R package.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Computer Simulation / statistics & numerical data
Flicker Fusion
Habituation, Psychophysiologic
Humans
Infant
Models, Statistical*
Reading
Reproducibility of Results*
Software / statistics & numerical data