STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG

IEEE Trans Biomed Eng. 2022 Jul;69(7):2233-2242. doi: 10.1109/TBME.2022.3140246. Epub 2022 Jun 17.

Abstract

Objective: Humans are able to localize the source of a sound. This enables them to direct attention to a particular speaker in a cocktail party. Psycho-acoustic studies show that the sensory cortices of the human brain respond to the location of sound sources differently, and the auditory attention itself is a dynamic and temporally based brain activity. In this work, we seek to build a computational model which uses both spatial and temporal information manifested in EEG signals for auditory spatial attention detection (ASAD).

Methods: We propose an end-to-end spatiotemporal attention network, denoted as STAnet, to detect auditory spatial attention from EEG. The STAnet is designed to assign differentiated weights dynamically to EEG channels through a spatial attention mechanism, and to temporal patterns in EEG signals through a temporal attention mechanism.

Results: We report the ASAD experiments on two publicly available datasets. The STAnet outperforms other competitive models by a large margin under various experimental conditions. Its attention decision for 1-second decision window outperforms that of the state-of-the-art techniques for 10-second decision window. Experimental results also demonstrate that the STAnet achieves competitive performance on EEG signals ranging from 64 to as few as 16 channels.

Conclusion: This study provides evidence suggesting that efficient low-density EEG online decoding is within reach.

Significance: This study also marks an important step towards the practical implementation of ASAD in real life applications.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustics
  • Brain*
  • Electroencephalography* / methods
  • Head
  • Humans
  • Sound