A Dynamic Neural Field Model of Multimodal Merging: Application to the Ventriloquist Effect

Simon Forest; Jean-Charles Quinton; Mathieu Lefort

doi:10.1162/neco_a_01509

A Dynamic Neural Field Model of Multimodal Merging: Application to the Ventriloquist Effect

Neural Comput. 2022 Jul 14;34(8):1701-1726. doi: 10.1162/neco_a_01509.

Authors

Simon Forest^{1

2}, Jean-Charles Quinton³, Mathieu Lefort⁴

Affiliations

¹ Univ. Grenoble Alpes, CNRS, Grenoble INP, LJK, UMR 5224, F-38000 Grenoble, France.
² Université de Lyon, Université Claude Bernard Lyon 1, CNRS, INSA Lyon, LIRIS, UMR 5205, F-69621 Villeurbanne, France simon.forest@univ-grenoble-alpes.fr.
³ Univ. Grenoble Alpes, CNRS, Grenoble INP, LJK, UMR 5224, F-38000, Grenoble, France quintonj@univ-grenoble-alpes.fr.
⁴ Université de Lyon, Université Claude Bernard Lyon 1, CNRS, INSA Lyon, LIRIS, UMR 5205, F-69621, Villeurbanne, France mathieu.lefort@univ-lyon1.fr.

PMID: 35798331
DOI: 10.1162/neco_a_01509

Abstract

Multimodal merging encompasses the ability to localize stimuli based on imprecise information sampled through individual senses such as sight and hearing. Merging decisions are standardly described using Bayesian models that fit behaviors over many trials, encapsulated in a probability distribution. We introduce a novel computational model based on dynamic neural fields able to simulate decision dynamics and generate localization decisions, trial by trial, adapting to varying degrees of discrepancy between audio and visual stimulations. Neural fields are commonly used to model neural processes at a mesoscopic scale-for instance, neurophysiological activity in the superior colliculus. Our model is fit to human psychophysical data of the ventriloquist effect, additionally testing the influence of retinotopic projection onto the superior colliculus and providing a quantitative performance comparison to the Bayesian reference model. While models perform equally on average, a qualitative analysis of free parameters in our model allows insights into the dynamics of the decision and the individual variations in perception caused by noise. We finally show that the increase in the number of free parameters does not result in overfitting and that the parameter space may be either reduced to fit specific criteria or exploited to perform well on more demanding tasks in the future. Indeed, beyond decision or localization tasks, our model opens the door to the simulation of behavioral dynamics, as well as saccade generation driven by multimodal stimulation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Computer Simulation
Humans
Photic Stimulation
Probability
Superior Colliculi* / physiology
Visual Perception* / physiology