Mesolimbic dopamine adapts the rate of learning from action

Luke T Coddington; Sarah E Lindo; Joshua T Dudman

doi:10.1038/s41586-022-05614-z

Mesolimbic dopamine adapts the rate of learning from action

Nature. 2023 Feb;614(7947):294-302. doi: 10.1038/s41586-022-05614-z. Epub 2023 Jan 18.

Authors

Luke T Coddington¹, Sarah E Lindo², Joshua T Dudman³

Affiliations

¹ Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA. coddingtonl@hhmi.org.
² Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA.
³ Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA. dudmanj@janelia.hhmi.org.

Abstract

Recent success in training artificial agents and robots derives from a combination of direct learning of behavioural policies and indirect learning through value functions^1-3. Policy learning and value learning use distinct algorithms that optimize behavioural performance and reward prediction, respectively. In animals, behavioural learning and the role of mesolimbic dopamine signalling have been extensively evaluated with respect to reward prediction⁴; however, so far there has been little consideration of how direct policy learning might inform our understanding⁵. Here we used a comprehensive dataset of orofacial and body movements to understand how behavioural policies evolved as naive, head-restrained mice learned a trace conditioning paradigm. Individual differences in initial dopaminergic reward responses correlated with the emergence of learned behavioural policy, but not the emergence of putative value encoding for a predictive cue. Likewise, physiologically calibrated manipulations of mesolimbic dopamine produced several effects inconsistent with value learning but predicted by a neural-network-based model that used dopamine signals to set an adaptive rate, not an error signal, for behavioural policy learning. This work provides strong evidence that phasic dopamine activity can regulate direct learning of behavioural policies, expanding the explanatory power of reinforcement learning models for animal learning⁶.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Animals
Behavior, Animal*
Conditioning, Psychological
Cues
Datasets as Topic
Dopamine* / metabolism
Head
Learning*
Mice
Movement
Neural Networks, Computer
Neural Pathways*
Reinforcement, Psychology*
Reward

Substances

Dopamine