Discovery of hierarchical representations for efficient planning

Momchil S Tomov; Samyukta Yagati; Agni Kumar; Wanqian Yang; Samuel J Gershman

doi:10.1371/journal.pcbi.1007594

Discovery of hierarchical representations for efficient planning

PLoS Comput Biol. 2020 Apr 6;16(4):e1007594. doi: 10.1371/journal.pcbi.1007594. eCollection 2020 Apr.

Authors

Momchil S Tomov^{1

2}, Samyukta Yagati³, Agni Kumar³, Wanqian Yang⁴, Samuel J Gershman²

Affiliations

¹ Program in Neuroscience, Harvard Medical School, Boston, Massachusetts, United States of America.
² Department of Psychology and Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America.
³ Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
⁴ School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts, United States of America.

Abstract

We propose that humans spontaneously organize environments into clusters of states that support hierarchical planning, enabling them to tackle challenging problems by breaking them down into sub-problems at various levels of abstraction. People constantly rely on such hierarchical presentations to accomplish tasks big and small-from planning one's day, to organizing a wedding, to getting a PhD-often succeeding on the very first attempt. We formalize a Bayesian model of hierarchy discovery that explains how humans discover such useful abstractions. Building on principles developed in structure learning and robotics, the model predicts that hierarchy discovery should be sensitive to the topological structure, reward distribution, and distribution of tasks in the environment. In five simulations, we show that the model accounts for previously reported effects of environment structure on planning behavior, such as detection of bottleneck states and transitions. We then test the novel predictions of the model in eight behavioral experiments, demonstrating how the distribution of tasks and rewards can influence planning behavior via the discovered hierarchy, sometimes facilitating and sometimes hindering performance. We find evidence that the hierarchy discovery process unfolds incrementally across trials. Finally, we propose how hierarchy discovery and hierarchical planning might be implemented in the brain. Together, these findings present an important advance in our understanding of how the brain might use Bayesian inference to discover and exploit the hidden hierarchical structure of the environment.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms
Bayes Theorem*
Brain / physiology*
Computer Simulation
Female
Humans
Learning / physiology*
Male
Markov Chains
Models, Neurological
Monte Carlo Method
Reward
Video Games

Grants and funding

R01 MH109177/MH/NIMH NIH HHS/United States