Asymptotic distributions of kappa statistics and their differences with many raters, many rating categories and two conditions

Biom J. 2018 Jan;60(1):146-154. doi: 10.1002/bimj.201700016. Epub 2017 Nov 7.

Abstract

In clinical research and in more general classification problems, a frequent concern is the reliability of a rating system. In the absence of a gold standard, agreement may be considered as an indication of reliability. When dealing with categorical data, the well-known kappa statistic is often used to measure agreement. The aim of this paper is to obtain a theoretical result about the asymptotic distribution of the kappa statistic with multiple items, multiple raters, multiple conditions, and multiple rating categories (more than two), based on recent work. The result settles a long lasting quest for the asymptotic variance of the kappa statistic in this situation and allows for the construction of asymptotic confidence intervals. A recent application to clinical endoscopy and to the diagnosis of inflammatory bowel diseases (IBDs) is shortly presented to complement the theoretical perspective.

Keywords: agreement; correlated kappa statistics; de Finetti representation theorem; inflammatory bowel diseases.

MeSH terms

  • Biometry / methods*
  • Models, Statistical
  • Monte Carlo Method
  • Sample Size