ReinforSec: An Automatic Generator of Synthetic Malware Samples and Denial-of-Service Attacks through Reinforcement Learning

Aldo Hernandez-Suarez; Gabriel Sanchez-Perez; Linda K Toscano-Medina; Hector Perez-Meana; Jesus Olivares-Mercado; Jose Portillo-Portillo; Gibran Benitez-Garcia; Ana Lucila Sandoval Orozco; Luis Javier García Villalba

doi:10.3390/s23031231

ReinforSec: An Automatic Generator of Synthetic Malware Samples and Denial-of-Service Attacks through Reinforcement Learning

Sensors (Basel). 2023 Jan 20;23(3):1231. doi: 10.3390/s23031231.

Authors

Aldo Hernandez-Suarez¹, Gabriel Sanchez-Perez¹, Linda K Toscano-Medina¹, Hector Perez-Meana¹, Jesus Olivares-Mercado¹, Jose Portillo-Portillo¹, Gibran Benitez-Garcia², Ana Lucila Sandoval Orozco³, Luis Javier García Villalba³

Affiliations

¹ Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico.
² Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo 182-8585, Japan.
³ Group of Analysis, Security and Systems (GASS), Department of Software Engineering and Artificial Intelligence (DISIA), Faculty of Computer Science and Engineering, Office 431, Universidad Complutense de Madrid (UCM), 28040 Madrid, Spain.

Abstract

In recent years, cybersecurity has been strengthened through the adoption of processes, mechanisms and rapid sources of indicators of compromise in critical areas. Among the most latent challenges are the detection, classification and eradication of malware and Denial of Service Cyber-Attacks (DoS). The literature has presented different ways to obtain and evaluate malware- and DoS-cyber-attack-related instances, either from a technical point of view or by offering ready-to-use datasets. However, acquiring fresh, up-to-date samples requires an arduous process of exploration, sandbox configuration and mass storage, which may ultimately result in an unbalanced or under-represented set. Synthetic sample generation has shown that the cost associated with setting up controlled environments and time spent on sample evaluation can be reduced. Nevertheless, the process is performed when the observations already belong to a characterized set, totally detached from a real environment. In order to solve the aforementioned, this work proposes a methodology for the generation of synthetic samples of malicious Portable Executable binaries and DoS cyber-attacks. The task is performed via a Reinforcement Learning engine, which learns from a baseline of different malware families and DoS cyber-attack network properties, resulting in new, mutated and highly functional samples. Experimental results demonstrate the high adaptability of the outputs as new input datasets for different Machine Learning algorithms.

Keywords: artificial intelligence; cybersecurity; cybersecurity datasets; denial-of-service; machine learning; malware; q-learning; reinforcement learning; synthetic sampling.

Grants and funding

This research received no external funding.