Reward gain model describes cortical use-dependent plasticity

Firas Mawase; Nicholas Wymbs; Shintaro Uehara; Pablo Celnik

doi:10.1109/EMBC.2016.7590626

Reward gain model describes cortical use-dependent plasticity

Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug:2016:5-8. doi: 10.1109/EMBC.2016.7590626.

Authors

Firas Mawase, Nicholas Wymbs, Shintaro Uehara, Pablo Celnik

PMID: 28268267
DOI: 10.1109/EMBC.2016.7590626

Abstract

Consistent repetitions of an action lead to plastic change in the motor cortex and cause shift in the direction of future movements. This process is known as use-dependent plasticity (UDP), one of the basic forms of the motor memory. We have recently demonstrated in a physiological study that success-related reinforcement signals could modulate the strength of UDP. We tested this idea by developing a computational approach that modeled the shift in the direction of future action as a change in preferred direction of population activity of neurons in the primary motor cortex. The rate of the change follows a modified temporal difference reinforcement learning algorithm, in which the learning policy is based on comparison between what reward the population experiences on a particular trial, and what it had expected on the basis of its previous learning. By using this model, we were able to characterize the nature of learning and retention of UDP. Exploring the relationship between reinforcement and UDP constitutes a crucial step toward understanding the basic blocks involved in the formation of motor memories.

MeSH terms

Adult
Algorithms
Computer Simulation
Female
Humans
Male
Models, Neurological*
Motor Cortex / physiology
Neuronal Plasticity / physiology*
Neurons / physiology
Nontherapeutic Human Experimentation
Reinforcement, Psychology
Reward
Thumb / physiology

Grants and funding

R01 HD073147/HD/NICHD NIH HHS/United States