Generative Models for Active Vision

Thomas Parr; Noor Sajid; Lancelot Da Costa; M Berk Mirza; Karl J Friston

doi:10.3389/fnbot.2021.651432

Generative Models for Active Vision

Front Neurorobot. 2021 Apr 13:15:651432. doi: 10.3389/fnbot.2021.651432. eCollection 2021.

Authors

Thomas Parr¹, Noor Sajid¹, Lancelot Da Costa^{1

2}, M Berk Mirza³, Karl J Friston¹

Affiliations

¹ Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, London, United Kingdom.
² Department of Mathematics, Imperial College London, London, United Kingdom.
³ Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, United Kingdom.

Abstract

The active visual system comprises the visual cortices, cerebral attention networks, and oculomotor system. While fascinating in its own right, it is also an important model for sensorimotor networks in general. A prominent approach to studying this system is active inference-which assumes the brain makes use of an internal (generative) model to predict proprioceptive and visual input. This approach treats action as ensuring sensations conform to predictions (i.e., by moving the eyes) and posits that visual percepts are the consequence of updating predictions to conform to sensations. Under active inference, the challenge is to identify the form of the generative model that makes these predictions-and thus directs behavior. In this paper, we provide an overview of the generative models that the brain must employ to engage in active vision. This means specifying the processes that explain retinal cell activity and proprioceptive information from oculomotor muscle fibers. In addition to the mechanics of the eyes and retina, these processes include our choices about where to move our eyes. These decisions rest upon beliefs about salient locations, or the potential for information gain and belief-updating. A key theme of this paper is the relationship between "looking" and "seeing" under the brain's implicit generative model of the visual world.

Keywords: Bayesian; active vision; attention; generative model; inference; oculomotion.

Grants and funding

WT_/Wellcome Trust/United Kingdom