Vulnerability of classifiers to evolutionary generated adversarial examples

Petra Vidnerová; Roman Neruda

doi:10.1016/j.neunet.2020.04.015

Vulnerability of classifiers to evolutionary generated adversarial examples

Neural Netw. 2020 Jul:127:168-181. doi: 10.1016/j.neunet.2020.04.015. Epub 2020 Apr 20.

Authors

Petra Vidnerová¹, Roman Neruda²

Affiliations

¹ The Czech Academy of Sciences, Institute of Computer Science, Pod Vodárenskou věží 271/2, 182 07 Prague 8, Czechia. Electronic address: petra@cs.cas.cz.
² The Czech Academy of Sciences, Institute of Computer Science, Pod Vodárenskou věží 271/2, 182 07 Prague 8, Czechia. Electronic address: roman@cs.cas.cz.

PMID: 32361547
DOI: 10.1016/j.neunet.2020.04.015

Abstract

This paper deals with the vulnerability of machine learning models to adversarial examples and its implication for robustness and generalization properties. We propose an evolutionary algorithm that can generate adversarial examples for any machine learning model in the black-box attack scenario. This way, we can find adversarial examples without access to model's parameters, only by querying the model at hand. We have tested a range of machine learning models including deep and shallow neural networks. Our experiments have shown that the vulnerability to adversarial examples is not only the problem of deep networks, but it spreads through various machine learning architectures. Rather, it depends on the type of computational units. Local units, such as Gaussian kernels, are less vulnerable to adversarial examples.

Keywords: Adversarial examples; Genetic algorithms; Kernel methods; Neural networks; Supervised learning.

MeSH terms

Algorithms
Humans
Machine Learning / trends
Neural Networks, Computer*
Pattern Recognition, Automated / methods*
Pattern Recognition, Automated / trends
Supervised Machine Learning* / trends