Recurrence is required to capture the representational dynamics of the human visual system

Tim C Kietzmann; Courtney J Spoerer; Lynn K A Sörensen; Radoslaw M Cichy; Olaf Hauk; Nikolaus Kriegeskorte

doi:10.1073/pnas.1905544116

Recurrence is required to capture the representational dynamics of the human visual system

Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.

Authors

Tim C Kietzmann^{1

2}, Courtney J Spoerer³, Lynn K A Sörensen⁴, Radoslaw M Cichy⁵, Olaf Hauk³, Nikolaus Kriegeskorte⁶

Affiliations

¹ MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge CB2 7EF, United Kingdom; tim.kietzmann@mrc-cbu.cam.ac.uk.
² Donders Institute for Brain, Cognition and Behaviour, Radboud University, 6525 HR Nijmegen, The Netherlands.
³ MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge CB2 7EF, United Kingdom.
⁴ Department of Psychology, University of Amsterdam, 1018 WD Amsterdam, The Netherlands.
⁵ Department of Education and Psychology, Freie Universität Berlin, 14195 Berlin, Germany.
⁶ Department of Psychology, Columbia University, New York, NY 10027.

Abstract

The human visual system is an intricate network of brain regions that enables us to recognize the world around us. Despite its abundant lateral and feedback connections, object processing is commonly viewed and studied as a feedforward process. Here, we measure and model the rapid representational dynamics across multiple stages of the human ventral stream using time-resolved brain imaging and deep learning. We observe substantial representational transformations during the first 300 ms of processing within and across ventral-stream regions. Categorical divisions emerge in sequence, cascading forward and in reverse across regions, and Granger causality analysis suggests bidirectional information flow between regions. Finally, recurrent deep neural network models clearly outperform parameter-matched feedforward models in terms of their ability to capture the multiregion cortical dynamics. Targeted virtual cooling experiments on the recurrent deep network models further substantiate the importance of their lateral and top-down connections. These results establish that recurrent models are required to understand information processing in the human ventral stream.

Keywords: deep recurrent neural networks; magnetoencephalography; object recognition; representational dynamics; virtual cooling.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Deep Learning
Feedback, Sensory
Female
Humans
Magnetoencephalography
Models, Neurological*
Nerve Net
Visual Pathways
Visual Perception / physiology*

Abstract

Publication types

MeSH terms

Grants and funding