Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning

K R Lohse; M W Miller; M Daou; W Valerius; M Jones

doi:10.1016/j.biopsycho.2019.107775

Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning

Biol Psychol. 2020 Jan:149:107775. doi: 10.1016/j.biopsycho.2019.107775. Epub 2019 Sep 26.

Authors

K R Lohse¹, M W Miller², M Daou³, W Valerius⁴, M Jones⁵

Affiliations

¹ University of Utah, Department of Health, Kinesiology, and Recreation, United States; University of Utah, Department of Physical Therapy and Athletic Training, United States. Electronic address: rehabinformatics@gmail.com.
² Auburn University, School of Kinesiology, United States; Auburn University, Center for Neuroscience, United States.
³ Auburn University, School of Kinesiology, United States; Coastal Carolina University, Department of Kinesiology, United States.
⁴ Auburn University, School of Kinesiology, United States.
⁵ University of Colorado, Department of Psychology and Neuroscience, United States.

PMID: 31563586
DOI: 10.1016/j.biopsycho.2019.107775

Abstract

Reward positivity (RewP) is an EEG component reflecting reward-prediction errors. Using multilevel models, we measured single-trial RewP amplitude from trial-to-trial, while reward and prediction varied during learning. Sixty participants completed a category-learning task in either engaging or sterile conditions with the RewP time-locked to feedback. Sequential analysis of single-trial RewP showed its relationship to current and previous accuracy, and the probability of changing one's response to subsequent stimuli. Simulations show these effects can be explained in detail by the dynamics of participants' expectations according to principles of reinforcement learning. The single-trial RewP findings were consistent with previous literature linking RewP to reward-prediction error under reinforcement-learning theory. In contrast, the aggregate RewP was unrelated to the engagement manipulation or to delayed retention performance. Thus the present results provide a detailed computational account how RewP relates to acute adaptation, but suggest RewP plays little role in long-term learning.

Keywords: Adaptation; EEG; Reinforcement learning; RewP.

Publication types

Clinical Trial
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Adaptation, Psychological / physiology*
Adult
Electroencephalography
Evoked Potentials / physiology
Female
Humans
Learning / physiology*
Male
Multilevel Analysis
Reinforcement, Psychology*
Reward*
Task Performance and Analysis
Young Adult