Seizure onset zone classification based on imbalanced iEEG with data augmentation

J Neural Eng. 2022 Nov 18;19(6). doi: 10.1088/1741-2552/aca04f.

Abstract

Objective. Identifying the seizure onset zone (SOZ) in patients with focal epilepsy is the critical information required for surgery. However, collecting this information is challenging, time-consuming, and subjective. Some machine learning methods reduce the workload of clinical experts in intracranial electroencephalogram (iEEG) visual diagnosis but face significant challenges because interictal iEEG clinical data often suffer from a significant class imbalance. We aim to generate synthetic data for the minority class.Approach. To make the clinically imbalanced data suitable for machine learning, we introduce an EEG augmentation method (EEGAug). The EEGAug method randomly selects several samples from the minority class and transforms them into the frequency domain. Then, different frequency bands from different samples are used to compose new data. Finally, a synthetic sample is generated after converting the new data back to the time domain.Main results. The imbalanced clinical iEEG data can be balanced and applied to machine learning models using the method. A one-dimensional convolutional neural network model is used to classify the SOZ and non-SOZ data. We compare the EEGAug method with other data augmentation methods and another method of class-balanced focal loss function, which is also used for solving the data imbalance problem by adjusting the weights between the minority and majority classes. The results show that the EEGAug method performs best in most data.Significance. Data imbalance is a widespread clinical problem. The EEGAug method can flexibly generate synthetic data for the minority class, yielding synthetic and raw data with a high distribution similarity. By using the EEGAug method, clinical data can be used in machine learning models.

Keywords: ECoG; epilepsy; machine learning; seizure onset.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Electroencephalography* / methods
  • Humans
  • Machine Learning
  • Seizures* / diagnosis