Asymptotic distributions of kappa statistics and their differences with many raters, many rating categories and two conditions

Luca Grassano; Guido Pagana; Marco Daperno; Enrico Bibbona; Mauro Gasparini

doi:10.1002/bimj.201700016

Asymptotic distributions of kappa statistics and their differences with many raters, many rating categories and two conditions

Biom J. 2018 Jan;60(1):146-154. doi: 10.1002/bimj.201700016. Epub 2017 Nov 7.

Authors

Luca Grassano¹, Guido Pagana^{2

3}, Marco Daperno⁴, Enrico Bibbona¹, Mauro Gasparini¹

Affiliations

¹ Politecnico di Torino, Department of Mathematical Sciences, Torino, Italy.
² Politecnico di Torino, Department of Automatics and Informatics, Torino, Italy.
³ Istituto Superiore Mario Boella, Torino, Italy.
⁴ Ospedale Ordine Mauriziano di Torino Umberto I, Torino, Italy.

PMID: 29110316
DOI: 10.1002/bimj.201700016

Abstract

In clinical research and in more general classification problems, a frequent concern is the reliability of a rating system. In the absence of a gold standard, agreement may be considered as an indication of reliability. When dealing with categorical data, the well-known kappa statistic is often used to measure agreement. The aim of this paper is to obtain a theoretical result about the asymptotic distribution of the kappa statistic with multiple items, multiple raters, multiple conditions, and multiple rating categories (more than two), based on recent work. The result settles a long lasting quest for the asymptotic variance of the kappa statistic in this situation and allows for the construction of asymptotic confidence intervals. A recent application to clinical endoscopy and to the diagnosis of inflammatory bowel diseases (IBDs) is shortly presented to complement the theoretical perspective.

Keywords: agreement; correlated kappa statistics; de Finetti representation theorem; inflammatory bowel diseases.

MeSH terms

Biometry / methods*
Models, Statistical
Monte Carlo Method
Sample Size