Repeated computerized cognitive testing: Performance shifts and test-retest reliability in healthy older adults

Naomi White; Larnee Flannery; Alice McClintock; Liana Machado

doi:10.1080/13803395.2018.1526888

Repeated computerized cognitive testing: Performance shifts and test-retest reliability in healthy older adults

J Clin Exp Neuropsychol. 2019 Mar;41(2):179-191. doi: 10.1080/13803395.2018.1526888. Epub 2018 Oct 15.

Authors

Naomi White^{1

2}, Larnee Flannery¹, Alice McClintock¹, Liana Machado^{1

2}

Affiliations

¹ a Department of Psychology and Brain Health Research Centre , University of Otago , Dunedin , New Zealand.
² b Brain Research New Zealand , Dunedin , New Zealand.

PMID: 30320531
DOI: 10.1080/13803395.2018.1526888

Abstract

Introduction: Repeated cognitive assessment is frequently required to monitor changes in cognitive functioning in older adults, but studies of repeated computerized testing in this population are scarce. To provide new insight into retest effects this study examined within- and between-day performance shifts and test-retest reliability among healthy older adults for test scores from a computerized cognitive battery. Method: Thirty older men (65-71 years) completed the battery six times. Testing occurred twice on each of three testing days, separated by 1 week. Results: Reaction times (RTs) on tasks intended to measure inhibition (Anti), response switching (Pro/Anti), selective attention (Simon and Flanker), and working memory (2-back) typically showed practice effects, which were most prominent between the first two time points. In most cases, these RTs showed moderate to good test-retest reliability (intraclass correlation coefficient, ICC, range = .34 to .93) with lower reliability between the first two time points. Two-back accuracy rates showed similar results. In contrast, RTs on a basic visuomotor task (Pro) and on compatible trials of the Simon task showed increases at later time points, presumably because of boredom, but demonstrated mostly moderate to good reliability (ICC range = .49 to .83). Scoring metrics from a computerized version of the Corsi Block-Tapping task (intended to measure short-term and working memory) and cost scores (performance differences between two related conditions/tasks) intended to isolate specific cognitive functions tended to show poor reliability (ICC range = -.23 to .62). Conclusions: Most of the RT tasks investigated showed suitability for use in repeated testing among older adults, although longer familiarization periods appear to be warranted in many cases to reduce practice effects and improve initial reliability. However, poor reliability indicated that scoring metrics from the computerized Corsi Block-Tapping task and cost scores are unsuitable for repeated testing.

Keywords: Cognition; cost scores; executive functions; neuropsychological assessment; practice effects; repeat testing; retest stability.

Publication types

Research Support, Non-U.S. Gov't
Validation Study

MeSH terms

Aged
Attention
Cognition
Cognition Disorders / diagnosis*
Cognition Disorders / psychology
Health Status
Humans
Inhibition, Psychological
Male
Memory, Short-Term
Neuropsychological Tests / statistics & numerical data*
Psychometrics / statistics & numerical data*
Psychomotor Performance
Reaction Time
Reproducibility of Results
Software*