Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning

Biol Psychol. 2020 Jan:149:107775. doi: 10.1016/j.biopsycho.2019.107775. Epub 2019 Sep 26.

Abstract

Reward positivity (RewP) is an EEG component reflecting reward-prediction errors. Using multilevel models, we measured single-trial RewP amplitude from trial-to-trial, while reward and prediction varied during learning. Sixty participants completed a category-learning task in either engaging or sterile conditions with the RewP time-locked to feedback. Sequential analysis of single-trial RewP showed its relationship to current and previous accuracy, and the probability of changing one's response to subsequent stimuli. Simulations show these effects can be explained in detail by the dynamics of participants' expectations according to principles of reinforcement learning. The single-trial RewP findings were consistent with previous literature linking RewP to reward-prediction error under reinforcement-learning theory. In contrast, the aggregate RewP was unrelated to the engagement manipulation or to delayed retention performance. Thus the present results provide a detailed computational account how RewP relates to acute adaptation, but suggest RewP plays little role in long-term learning.

Keywords: Adaptation; EEG; Reinforcement learning; RewP.

Publication types

  • Clinical Trial
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adaptation, Psychological / physiology*
  • Adult
  • Electroencephalography
  • Evoked Potentials / physiology
  • Female
  • Humans
  • Learning / physiology*
  • Male
  • Multilevel Analysis
  • Reinforcement, Psychology*
  • Reward*
  • Task Performance and Analysis
  • Young Adult