Rater variables associated with ITER ratings

Michael Paget; Caren Wu; Joann McIlwrick; Wayne Woloschuk; Bruce Wright; Kevin McLaughlin

doi:10.1007/s10459-012-9391-y

Rater variables associated with ITER ratings

Adv Health Sci Educ Theory Pract. 2013 Oct;18(4):551-7. doi: 10.1007/s10459-012-9391-y. Epub 2012 Jul 10.

Authors

Michael Paget¹, Caren Wu, Joann McIlwrick, Wayne Woloschuk, Bruce Wright, Kevin McLaughlin

Affiliation

¹ Office of Undergraduate Medical Education, Health Sciences Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada.

PMID: 22777161
DOI: 10.1007/s10459-012-9391-y

Abstract

Advocates of holistic assessment consider the ITER a more authentic way to assess performance. But this assessment format is subjective and, therefore, susceptible to rater bias. Here our objective was to study the association between rater variables and ITER ratings. In this observational study our participants were clerks at the University of Calgary and preceptors who completed online ITERs between February 2008 and July 2009. Our outcome variable was global rating on the ITER (rated 1-5), and we used a generalized estimating equation model to identify variables associated with this rating. Students were rated "above expected level" or "outstanding" on 66.4 % of 1050 online ITERs completed during the study period. Two rater variables attenuated ITER ratings: the log transformed time taken to complete the ITER [β = -0.06, 95 % confidence interval (-0.10, -0.02), p = 0.002], and the number of ITERs that a preceptor completed over the time period of the study [β = -0.008 (-0.02, -0.001), p = 0.02]. In this study we found evidence of leniency bias that resulted in two thirds of students being rated above expected level of performance. This leniency bias appeared to be attenuated by delay in ITER completion, and was also blunted in preceptors who rated more students. As all biases threaten the internal validity of the assessment process, further research is needed to confirm these and other sources of rater bias in ITER ratings, and to explore ways of limiting their impact.

MeSH terms

Alberta
Clinical Clerkship* / organization & administration
Clinical Competence / standards*
Competency-Based Education
Cross-Sectional Studies
Education, Medical, Undergraduate
Educational Measurement / methods
Educational Measurement / standards
Humans
Reproducibility of Results
Students, Medical