Converging on quality: Examining multiple measures of teaching effectiveness

Lia E Sandilos; Wesley A Sims; Kate E Norwalk; Linda A Reddy

doi:10.1016/j.jsp.2019.05.004

Converging on quality: Examining multiple measures of teaching effectiveness

J Sch Psychol. 2019 Jun:74:10-28. doi: 10.1016/j.jsp.2019.05.004. Epub 2019 May 20.

Authors

Lia E Sandilos¹, Wesley A Sims², Kate E Norwalk³, Linda A Reddy⁴

Affiliations

¹ Temple University, United States of America. Electronic address: lia.sandilos@temple.edu.
² University of California Riverside, United States of America.
³ North Carolina State University, United States of America.
⁴ Rutgers University, United States of America.

PMID: 31213228
DOI: 10.1016/j.jsp.2019.05.004

Abstract

The present study explores the convergent and predictive validity for several widely used measures of teaching quality from the Measures of Effective Teaching Project (Bill and Melinda Gates Foundation, 2009-2011). Specifically, the Classroom Assessment Scoring System (CLASS; Pianta, Hamre, & Mintz, 2012), the Framework for Teaching (FFT; Danielson Group, 2013), and the Tripod Student Perceptions Scale (Tripod; Ferguson, 2008) were examined. Correlations among measures were assessed by developmental level and content area (elementary mathematics N = 70; elementary English language arts N = 101; middle school mathematics N = 291, middle school English language arts N = 280). Both average scores and score variability (i.e., coefficient of variation) for the CLASS, FFT, and Tripod were used to predict value-added models (VAM), a high-stakes measure of students' academic growth. For elementary mathematics and ELA, findings indicated the CLASS and FFT exhibited moderate convergent validity while divergent validity was found between the Tripod and the CLASS and FFT. Across content areas in middle school grades, the CLASS, FFT, and Tripod exhibited moderate to high-moderate convergent validity. Average student and observer scores were positively related to VAM scores, whereas variability in scores demonstrated negative relations to VAM scores. Implications of findings for teacher evaluation and professional development are discussed.

Keywords: Classroom assessment; Student achievement; Teacher evaluation.

MeSH terms

Academic Success*
Child
Female
Humans
Language
Male
Mathematics / education
Psychometrics
Reproducibility of Results
School Teachers / standards*
Students*
Teaching / standards*