A neural mechanism for contextualizing fragmented inputs during naturalistic vision

Elife. 2019 Oct 9:8:e48182. doi: 10.7554/eLife.48182.

Abstract

With every glimpse of our eyes, we sample only a small and incomplete fragment of the visual world, which needs to be contextualized and integrated into a coherent scene representation. Here we show that the visual system achieves this contextualization by exploiting spatial schemata, that is our knowledge about the composition of natural scenes. We measured fMRI and EEG responses to incomplete scene fragments and used representational similarity analysis to reconstruct their cortical representations in space and time. We observed a sorting of representations according to the fragments' place within the scene schema, which occurred during perceptual analysis in the occipital place area and within the first 200 ms of vision. This schema-based coding operates flexibly across visual features (as measured by a deep neural network model) and different types of environments (indoor and outdoor scenes). This flexibility highlights the mechanism's ability to efficiently organize incoming information under dynamic real-world conditions.

Keywords: deep neural network models; fMRI/EEG; human; multivariate pattern analysis; neuroscience; real-world structure; scene representation; visual perception.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Cognition*
  • Electroencephalography
  • Female
  • Humans
  • Magnetic Resonance Imaging
  • Male
  • Models, Neurological
  • Visual Perception*
  • Young Adult