Hedging your bets by learning reward correlations in the human brain

Klaus Wunderlich; Mkael Symmonds; Peter Bossaerts; Raymond J Dolan

doi:10.1016/j.neuron.2011.07.025

Hedging your bets by learning reward correlations in the human brain

Neuron. 2011 Sep 22;71(6):1141-52. doi: 10.1016/j.neuron.2011.07.025. Epub 2011 Sep 21.

Authors

Klaus Wunderlich¹, Mkael Symmonds, Peter Bossaerts, Raymond J Dolan

Affiliation

¹ Wellcome Trust Center for Neuroimaging, University College London, London WC1N 3BG, UK. k.wunderlich@ucl.ac.uk

Abstract

Human subjects are proficient at tracking the mean and variance of rewards and updating these via prediction errors. Here, we addressed whether humans can also learn about higher-order relationships between distinct environmental outcomes, a defining ecological feature of contexts where multiple sources of rewards are available. By manipulating the degree to which distinct outcomes are correlated, we show that subjects implemented an explicit model-based strategy to learn the associated outcome correlations and were adept in using that information to dynamically adjust their choices in a task that required a minimization of outcome variance. Importantly, the experimentally generated outcome correlations were explicitly represented neuronally in right midinsula with a learning prediction error signal expressed in rostral anterior cingulate cortex. Thus, our data show that the human brain represents higher-order correlation structures between rewards, a core adaptive ability whose immediate benefit is optimized sampling.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Brain / anatomy & histology
Brain / physiology*
Choice Behavior
Environment
Female
Humans
Learning / physiology*
Magnetic Resonance Imaging
Male
Models, Neurological*
Neuropsychological Tests
Reward*
Young Adult

Grants and funding

Wellcome Trust/United Kingdom