Discovering Implied Serial Order Through Model-Free and Model-Based Learning

Greg Jensen; Herbert S Terrace; Vincent P Ferrera

doi:10.3389/fnins.2019.00878

Discovering Implied Serial Order Through Model-Free and Model-Based Learning

Front Neurosci. 2019 Aug 20:13:878. doi: 10.3389/fnins.2019.00878. eCollection 2019.

Authors

Greg Jensen^{1

2}, Herbert S Terrace^{1

3}, Vincent P Ferrera^{2

3}

Affiliations

¹ Department of Psychology, Columbia University, New York, NY, United States.
² Department of Neuroscience, Columbia University, New York, NY, United States.
³ Department of Psychiatry, Columbia University, New York, NY, United States.

Abstract

Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature.

Keywords: cognitive maps; model-based learning; model-free learning; reinforcement learning; transitive inference.

Discovering Implied Serial Order Through Model-Free and Model-Based Learning

Authors

Affiliations

Abstract

Publication types

Associated data

Grants and funding