Reading Comprehension Tests for Children: Test Equating and Specific Age-Interval Reports

Patrícia Silva Lúcio; Fausto Coutinho Lourenço; Hugo Cogo-Moreira; Deborah Bandalos; Carolina Alves Ferreira de Carvalho; Adriana de Souza Batista Kida; Clara Regina Brandão de Ávila

doi:10.3389/fpsyg.2021.662192

Reading Comprehension Tests for Children: Test Equating and Specific Age-Interval Reports

Front Psychol. 2021 Sep 10:12:662192. doi: 10.3389/fpsyg.2021.662192. eCollection 2021.

Authors

Patrícia Silva Lúcio¹, Fausto Coutinho Lourenço², Hugo Cogo-Moreira³, Deborah Bandalos⁴, Carolina Alves Ferreira de Carvalho⁵, Adriana de Souza Batista Kida⁵, Clara Regina Brandão de Ávila⁵

Affiliations

¹ Department of Psychology and Psychoanalysis, State University of Londrina, Londrina, Brazil.
² Department of Psychobiology, Federal University of São Paulo, São Paulo, Brazil.
³ Faculty of Teacher Education and Languages, Østfold University College, Halden, Norway.
⁴ Assessment and Measurement PhD Program, James Madison University, Harrisonburg, VA, United States.
⁵ Department of Speech-Language Pathology and Audiology, Federal University of São Paulo, São Paulo, Brazil.

Abstract

Equating is used to directly compare alternate forms of tests. We describe the equating of two alternative forms of a reading comprehension test for Brazilian children (2nd to 5th grade), Form A (n = 427) and Form B (n = 321). We employed non-equivalent random groups design with internal anchor items. Local independence was attested via standardized residual Pearson's bivariate correlation. First, from 176 items, we selected 42 in each form (33 unique and 9 in common) using 2PL model, a one-dimensional item response theory (IRT) model. Using the equateIRT package for R, the anchor items were used to link both forms. Linking coefficients were estimated under two different methods (Haebara and Stocking-Lord), resulting in scores equating by two methods: observed score equating (OSE) and true score equating (TSE). We provided reference-specific age-intervals for the sample. The final version was informative for a wide range of theta abilities. We concluded that the forms could be used interchangeably.

Keywords: anchor items; concurrent calibration; equating; item response theory; reading comprehension.