Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features

Simone Palazzo; Concetto Spampinato; Isaak Kavasidis; Daniela Giordano; Joseph Schmidt; Mubarak Shah

doi:10.1109/TPAMI.2020.2995909

Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3833-3849. doi: 10.1109/TPAMI.2020.2995909. Epub 2021 Oct 1.

Authors

Simone Palazzo, Concetto Spampinato, Isaak Kavasidis, Daniela Giordano, Joseph Schmidt, Mubarak Shah

PMID: 32750768
DOI: 10.1109/TPAMI.2020.2995909

Abstract

This work presents a novel method of exploring human brain-visual representations, with a view towards replicating these processes in machines. The core idea is to learn plausible computational and biological representations by correlating human neural activity and natural images. Thus, we first propose a model, EEG-ChannelNet, to learn a brain manifold for EEG classification. After verifying that visual information can be extracted from EEG data, we introduce a multimodal approach that uses deep image and EEG encoders, trained in a siamese configuration, for learning a joint manifold that maximizes a compatibility measure between visual features and brain representations. We then carry out image classification and saliency detection on the learned manifold. Performance analyses show that our approach satisfactorily decodes visual information from neural signals. This, in turn, can be used to effectively supervise the training of deep learning models, as demonstrated by the high performance of image classification and saliency detection on out-of-training classes. The obtained results show that the learned brain-visual features lead to improved performance and simultaneously bring deep models more in line with cognitive neuroscience work related to visual perception and attention.

MeSH terms

Algorithms*
Attention
Brain / diagnostic imaging
Humans
Neural Networks, Computer*
Visual Perception