Perturbing BEAMs: EEG adversarial attack to deep learning models for epilepsy diagnosing

Jianfeng Yu; Kai Qiu; Pengju Wang; Caixia Su; Yufeng Fan; Yongfeng Cao

doi:10.1186/s12911-023-02212-5

Perturbing BEAMs: EEG adversarial attack to deep learning models for epilepsy diagnosing

BMC Med Inform Decis Mak. 2023 Jul 6;23(1):115. doi: 10.1186/s12911-023-02212-5.

Authors

Jianfeng Yu^#¹, Kai Qiu^#¹, Pengju Wang¹, Caixia Su¹, Yufeng Fan¹, Yongfeng Cao²

Affiliations

¹ School of Big Data and Computer Science, Guizhou Normal University, Guiyang, 550025, China.
² School of Big Data and Computer Science, Guizhou Normal University, Guiyang, 550025, China. cyfeis@gznu.edu.cn.

^# Contributed equally.

Abstract

Deep learning models have been widely used in electroencephalogram (EEG) analysis and obtained excellent performance. But the adversarial attack and defense for them should be thoroughly studied before putting them into safety-sensitive use. This work exposes an important safety issue in deep-learning-based brain disease diagnostic systems by examining the vulnerability of deep learning models for diagnosing epilepsy with brain electrical activity mappings (BEAMs) to white-box attacks. It proposes two methods, Gradient Perturbations of BEAMs (GPBEAM), and Gradient Perturbations of BEAMs with Differential Evolution (GPBEAM-DE), which generate EEG adversarial samples, for the first time by perturbing BEAMs densely and sparsely respectively, and find that these BEAMs-based adversarial samples can easily mislead deep learning models. The experiments use the EEG data from CHB-MIT dataset and two types of victim models each of which has four different deep neural network (DNN) architectures. It is shown that: (1) these BEAM-based adversarial samples produced by the proposed methods in this paper are aggressive to BEAM-related victim models which use BEAMs as the input to internal DNN architectures, but unaggressive to EEG-related victim models which have raw EEG as the input to internal DNN architectures, with the top success rate of attacking BEAM-related models up to 0.8 while the top success rate of attacking EEG-related models only 0.01; (2) GPBEAM-DE outperforms GPBEAM when they are attacking the same victim model under a same distortion constraint, with the top attack success rate 0.8 for the former and 0.59 for the latter; (3) a simple modification to the GPBEAM/GPBEAM-DE will make it have aggressiveness to both BEAMs-related and EEG-related models (with top attack success rate 0.8 and 0.64), and this capacity enhancement is done without any cost of distortion increment. The goal of this study is not to attack any of EEG medical diagnostic systems, but to raise concerns about the safety of deep learning models and hope to lead to a safer design.

Keywords: Adversarial attack; BEAMs; Deep learning model; EEG; Epilepsy; Sparse attack.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Brain
Brain Mapping
Deep Learning*
Electroencephalography
Epilepsy* / diagnosis
Humans

Grants and funding

GZKJ[2017]1128/Guizhou Provincial Science and Technology Foundation