Discovering Implied Serial Order Through Model-Free and Model-Based Learning

Front Neurosci. 2019 Aug 20:13:878. doi: 10.3389/fnins.2019.00878. eCollection 2019.

Abstract

Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature.

Keywords: cognitive maps; model-based learning; model-free learning; reinforcement learning; transitive inference.

Publication types

  • Review

Associated data

  • figshare/10.6084/m9.figshare.7992005.v1