Human thalamic low-frequency oscillations correlate with expected value and outcomes during reinforcement learning

Antoine Collomb-Clerc; Maëlle C M Gueguen; Lorella Minotti; Philippe Kahane; Vincent Navarro; Fabrice Bartolomei; Romain Carron; Jean Regis; Stephan Chabardès; Stefano Palminteri; Julien Bastin

doi:10.1038/s41467-023-42380-6

Human thalamic low-frequency oscillations correlate with expected value and outcomes during reinforcement learning

Nat Commun. 2023 Oct 17;14(1):6534. doi: 10.1038/s41467-023-42380-6.

Authors

Antoine Collomb-Clerc¹, Maëlle C M Gueguen^{1

2}, Lorella Minotti^{1

3}, Philippe Kahane^{1

3}, Vincent Navarro⁴, Fabrice Bartolomei^{5

6}, Romain Carron^{6

7}, Jean Regis⁸, Stephan Chabardès^{1

9}, Stefano Palminteri^#¹⁰, Julien Bastin^#¹¹

Affiliations

¹ Univ. Grenoble Alpes, Inserm, U1216, CHU Grenoble Alpes, Grenoble Institut Neurosciences, 38000, Grenoble, France.
² Department of Psychiatry, Brain Health Institute and University Behavioral Health Care, Rutgers University-New Brunswick, Piscataway, NJ, USA.
³ Neurology Department, University Hospital of Grenoble, Grenoble, France.
⁴ Sorbonne Université, Paris Brain Institute - Institut du Cerveau, ICM, INSERM, CNRS, AP-HP, Pitié-Salpêtrière Hospital, Paris, France.
⁵ Timone University Hospital, Sleep Unit, Epileptology and Cerebral Rhythmology, University Hospital of Marseille, Marseille, France.
⁶ Aix Marseille University, Inserm, Institut de Neurosciences des Systèmes, Marseille, France.
⁷ Timone University Hospital, Department of functional and stereotactic neurosurgery, University Hospital of Marseille, Marseille, France.
⁸ Neurosurgery Department, University Hospital of Marseille, Marseille, France.
⁹ Neurosurgery Department, University Hospital of Grenoble, Grenoble, France.
¹⁰ Laboratoire de Neurosciences Cognitives Computationnelles, Département d'Etudes Cognitives, ENS, PSL, INSERM, Paris, France.
¹¹ Univ. Grenoble Alpes, Inserm, U1216, CHU Grenoble Alpes, Grenoble Institut Neurosciences, 38000, Grenoble, France. julien.bastin@univ-grenoble-alpes.fr.

^# Contributed equally.

Abstract

Reinforcement-based adaptive decision-making is believed to recruit fronto-striatal circuits. A critical node of the fronto-striatal circuit is the thalamus. However, direct evidence of its involvement in human reinforcement learning is lacking. We address this gap by analyzing intra-thalamic electrophysiological recordings from eight participants while they performed a reinforcement learning task. We found that in both the anterior thalamus (ATN) and dorsomedial thalamus (DMTN), low frequency oscillations (LFO, 4-12 Hz) correlated positively with expected value estimated from computational modeling during reward-based learning (after outcome delivery) or punishment-based learning (during the choice process). Furthermore, LFO recorded from ATN/DMTN were also negatively correlated with outcomes so that both components of reward prediction errors were signaled in the human thalamus. The observed differences in the prediction signals between rewarding and punishing conditions shed light on the neural mechanisms underlying action inhibition in punishment avoidance learning. Our results provide insight into the role of thalamus in reinforcement-based decision-making in humans.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Avoidance Learning / physiology
Humans
Punishment
Reinforcement, Psychology*
Reward*
Thalamus

Associated data

figshare/10.6084/m9.figshare.23659896