Learning in Volatile Environments With the Bayes Factor Surprise

Vasiliki Liakoni; Alireza Modirshanechi; Wulfram Gerstner; Johanni Brea

doi:10.1162/neco_a_01352

Learning in Volatile Environments With the Bayes Factor Surprise

Neural Comput. 2021 Feb;33(2):269-340. doi: 10.1162/neco_a_01352. Epub 2021 Jan 5.

Authors

Vasiliki Liakoni¹, Alireza Modirshanechi², Wulfram Gerstner³, Johanni Brea⁴

Affiliations

¹ École Polytechnique Fédérale de Lausanne, School of Computer and Communication Sciences and School of Life Sciences, 1015 Lausanne, Switzerland vasiliki.liakoni@epfl.ch.
² École Polytechnique Fédérale de Lausanne, School of Computer and Communication Sciences and School of Life Sciences, 1015 Lausanne, Switzerland alireza.modirshanechi@epfl.ch.
³ École Polytechnique Fédérale de Lausanne, School of Computer and Communication Sciences and School of Life Sciences, 1015 Lausanne, Switzerland wulfram.gerstner@epfl.ch.
⁴ École Polytechnique Fédérale de Lausanne, School of Computer and Communication Sciences and School of Life Sciences, 1015 Lausanne, Switzerland johanni.brea@epfl.ch.

PMID: 33400898
DOI: 10.1162/neco_a_01352

Abstract

Surprise-based learning allows agents to rapidly adapt to nonstationary stochastic environments characterized by sudden changes. We show that exact Bayesian inference in a hierarchical model gives rise to a surprise-modulated trade-off between forgetting old observations and integrating them with the new ones. The modulation depends on a probability ratio, which we call the Bayes Factor Surprise, that tests the prior belief against the current belief. We demonstrate that in several existing approximate algorithms, the Bayes Factor Surprise modulates the rate of adaptation to new observations. We derive three novel surprise-based algorithms, one in the family of particle filters, one in the family of variational learning, and one in the family of message passing, that have constant scaling in observation sequence length and particularly simple update dynamics for any distribution in the exponential family. Empirical results show that these surprise-based algorithms estimate parameters better than alternative approximate approaches and reach levels of performance comparable to computationally more expensive algorithms. The Bayes Factor Surprise is related to but different from the Shannon Surprise. In two hypothetical experiments, we make testable predictions for physiological indicators that dissociate the Bayes Factor Surprise from the Shannon Surprise. The theoretical insight of casting various approaches as surprise-based learning, as well as the proposed online algorithms, may be applied to the analysis of animal and human behavior and to reinforcement learning in nonstationary environments.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Animals
Bayes Theorem
Behavior / physiology*
Computer Simulation*
Humans
Learning / physiology*
Reinforcement, Psychology*