Data Augmentation Effects on Highly Imbalanced EEG Datasets for Automatic Detection of Photoparoxysmal Responses

Sensors (Basel). 2023 Feb 19;23(4):2312. doi: 10.3390/s23042312.

Abstract

Photosensitivity is a neurological disorder in which a person's brain produces epileptic discharges, known as Photoparoxysmal Responses (PPRs), when it receives certain visual stimuli. The current standardized diagnosis process used in hospitals consists of submitting the subject to the Intermittent Photic Stimulation process and attempting to trigger these phenomena. The brain activity is measured by an Electroencephalogram (EEG), and the clinical specialists manually look for the PPRs that were provoked during the session. Due to the nature of this disorder, long EEG recordings may contain very few PPR segments, meaning that a highly imbalanced dataset is available. To tackle this problem, this research focused on applying Data Augmentation (DA) to create synthetic PPR segments from the real ones, improving the balance of the dataset and, thus, the global performance of the Machine Learning techniques applied for automatic PPR detection. K-Nearest Neighbors and a One-Hidden-Dense-Layer Neural Network were employed to evaluate the performance of this DA stage. The results showed that DA is able to improve the models, making them more robust and more able to generalize. A comparison with the results obtained from a previous experiment also showed a performance improvement of around 20% for the Accuracy and Specificity measurements without Sensitivity suffering any losses. This project is currently being carried out with subjects at Burgos University Hospital, Spain.

Keywords: Data Augmentation; EEG; Machine Learning; PPR; Photoparoxysmal Response; electroencephalography; epilepsy; photosensitivity.

MeSH terms

  • Body Fluids*
  • Brain
  • Cluster Analysis
  • Electroencephalography*
  • Hospitals
  • Humans

Grants and funding

This research was funded by the Spanish Ministry of Economics and Industry, Grant PID2020-112726RB-I00, by the Spanish Research Agency (AEI, Spain) under Grant agreement RED2018-102312-T (IA-Biomed), and by the Ministry of Science and Innovation under CERVERA Excellence Network project CER-20211003 (IBERUS) and Missions Science and Innovation project MIG-20211008 (INMERBOT), as well as by Principado de Asturias, Grant SV-PA-21-AYUD/2021/50994.