Restricted Recalibration of Item Response Theory Models

Yang Liu; Ji Seung Yang; Alberto Maydeu-Olivares

doi:10.1007/s11336-019-09667-4

Restricted Recalibration of Item Response Theory Models

Psychometrika. 2019 Jun;84(2):529-553. doi: 10.1007/s11336-019-09667-4. Epub 2019 Mar 20.

Authors

Yang Liu¹, Ji Seung Yang², Alberto Maydeu-Olivares^{3

4}

Affiliations

¹ Department of Human Development and Quantitative Methodology, University of Maryland, College Park, USA. yliu87@umd.edu.
² Department of Human Development and Quantitative Methodology, University of Maryland, College Park, USA.
³ Department of Psychology, University of South Carolina, Columbia, USA.
⁴ Department of Psychology, University of Barcelona, Barcelona, Spain.

PMID: 30895437
DOI: 10.1007/s11336-019-09667-4

Abstract

In item response theory (IRT), it is often necessary to perform restricted recalibration (RR) of the model: A set of (focal) parameters is estimated holding a set of (nuisance) parameters fixed. Typical applications of RR include expanding an existing item bank, linking multiple test forms, and associating constructs measured by separately calibrated tests. In the current work, we provide full statistical theory for RR of IRT models under the framework of pseudo-maximum likelihood estimation. We describe the standard error calculation for the focal parameters, the assessment of overall goodness-of-fit (GOF) of the model, and the identification of misfitting items. We report a simulation study to evaluate the performance of these methods in the scenario of adding a new item to an existing test. Parameter recovery for the focal parameters as well as Type I error and power of the proposed tests are examined. An empirical example is also included, in which we validate the pediatric fatigue short-form scale in the Patient-Reported Outcome Measurement Information System (PROMIS), compute global and local GOF statistics, and update parameters for the misfitting items.

Keywords: contingency table; cross-validation; goodness of fit; item calibration; item response theory; measurement invariance; pseudo-maximum likelihood; residual.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Calibration*
Humans
Likelihood Functions*
Patient Reported Outcome Measures*
Psychometrics