Simulating bout-and-pause patterns with reinforcement learning

Kota Yamada; Atsunori Kanemura

doi:10.1371/journal.pone.0242201

Simulating bout-and-pause patterns with reinforcement learning

PLoS One. 2020 Nov 12;15(11):e0242201. doi: 10.1371/journal.pone.0242201. eCollection 2020.

Authors

Kota Yamada^{1

2}, Atsunori Kanemura²

Affiliations

¹ Keio University, Tokyo, Japan.
² LeapMind Inc. Tokyo, Japan.

Abstract

Animal responses occur according to a specific temporal structure composed of two states, where a bout is followed by a long pause until the next bout. Such a bout-and-pause pattern has three components: the bout length, the within-bout response rate, and the bout initiation rate. Previous studies have investigated how these three components are affected by experimental manipulations. However, it remains unknown what underlying mechanisms cause bout-and-pause patterns. In this article, we propose two mechanisms and examine computational models developed based on reinforcement learning. The model is characterized by two mechanisms. The first mechanism is choice-an agent makes a choice between operant and other behaviors. The second mechanism is cost-a cost is associated with the changeover of behaviors. These two mechanisms are extracted from past experimental findings. Simulation results suggested that both the choice and cost mechanisms are required to generate bout-and-pause patterns and if either of them is knocked out, the model does not generate bout-and-pause patterns. We further analyzed the proposed model and found that it reproduced the relationships between experimental manipulations and the three components that have been reported by previous studies. In addition, we showed alternative models can generate bout-and-pause patterns as long as they implement the two mechanisms.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Choice Behavior
Models, Neurological*
Reinforcement, Psychology*

Grants and funding

This study was supported in part by Grant-in-Aid for JSPS Fellows (20J21568) to KY from the Japan Society for the Promotion of Science (http://www.jsps.go.jp/english/e-grants). The funder had no role in study design, data collection, data analysis, and preparation of the manuscript. KY and AK are employed by and receive salaries from LeapMind Inc. (https://leapmind.io/en/), and the both authors played roles in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section. There was no additional external funding received for this study.