Enhanced Equivalence Projective Simulation: A Framework for Modeling Formation of Stimulus Equivalence Classes

Asieh Abolpou Mofrad; Anis Yazidi; Samaneh Abolpour Mofrad; Hugo L Hammer; Erik Arntzen

doi:10.1162/neco_a_01346

Enhanced Equivalence Projective Simulation: A Framework for Modeling Formation of Stimulus Equivalence Classes

Neural Comput. 2021 Feb;33(2):483-527. doi: 10.1162/neco_a_01346. Epub 2020 Nov 30.

Authors

Asieh Abolpou Mofrad¹, Anis Yazidi², Samaneh Abolpour Mofrad³, Hugo L Hammer⁴, Erik Arntzen⁵

Affiliations

¹ Department of Computer Science, Oslo Metropolitan University, 0130 Oslo, Norway asieh.abolpour-mofrad@oslomet.no.
² Department of Computer Science, Oslo Metropolitan University, 0130 Oslo, Norway Anis.Yazidi@oslomet.no.
³ Department of Computer Science, Electrical Engineering, and Mathematical Sciences, Western Norway University of Applied Sciences, 5063 Bergen, Norway, and Mohn Medical Imaging and Visualization Center, Department of Radiology, Haukeland University Hospital, 5021 Bergen, Norway Samaneh.Abolpour.Mofrad@hvl.no.
⁴ Department of Computer Science, Oslo Metropolitan University, 0130 Oslo, Norway, and Simula Metropolitan Center, 1325 Oslo, Norway Hugo.Hammer@oslomet.no.
⁵ Department of Behavioral Science, Oslo Metropolitan University, 0130 Oslo, Norway erik.arntzen@equivalence.net.

PMID: 33253033
DOI: 10.1162/neco_a_01346

Abstract

Formation of stimulus equivalence classes has been recently modeled through equivalence projective simulation (EPS), a modified version of a projective simulation (PS) learning agent. PS is endowed with an episodic memory that resembles the internal representation in the brain and the concept of cognitive maps. PS flexibility and interpretability enable the EPS model and, consequently the model we explore in this letter, to simulate a broad range of behaviors in matching-to-sample experiments. The episodic memory, the basis for agent decision making, is formed during the training phase. Derived relations in the EPS model that are not trained directly but can be established via the network's connections are computed on demand during the test phase trials by likelihood reasoning. In this letter, we investigate the formation of derived relations in the EPS model using network enhancement (NE), an iterative diffusion process, that yields an offline approach to the agent decision making at the testing phase. The NE process is applied after the training phase to denoise the memory network so that derived relations are formed in the memory network and retrieved during the testing phase. During the NE phase, indirect relations are enhanced, and the structure of episodic memory changes. This approach can also be interpreted as the agent's replay after the training phase, which is in line with recent findings in behavioral and neuroscience studies. In comparison with EPS, our model is able to model the formation of derived relations and other features such as the nodal effect in a more intrinsic manner. Decision making in the test phase is not an ad hoc computational method, but rather a retrieval and update process of the cached relations from the memory network based on the test trial. In order to study the role of parameters on agent performance, the proposed model is simulated and the results discussed through various experimental settings.