Vulnerability of classifiers to evolutionary generated adversarial examples

Neural Netw. 2020 Jul:127:168-181. doi: 10.1016/j.neunet.2020.04.015. Epub 2020 Apr 20.

Abstract

This paper deals with the vulnerability of machine learning models to adversarial examples and its implication for robustness and generalization properties. We propose an evolutionary algorithm that can generate adversarial examples for any machine learning model in the black-box attack scenario. This way, we can find adversarial examples without access to model's parameters, only by querying the model at hand. We have tested a range of machine learning models including deep and shallow neural networks. Our experiments have shown that the vulnerability to adversarial examples is not only the problem of deep networks, but it spreads through various machine learning architectures. Rather, it depends on the type of computational units. Local units, such as Gaussian kernels, are less vulnerable to adversarial examples.

Keywords: Adversarial examples; Genetic algorithms; Kernel methods; Neural networks; Supervised learning.

MeSH terms

  • Algorithms
  • Humans
  • Machine Learning / trends
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / methods*
  • Pattern Recognition, Automated / trends
  • Supervised Machine Learning* / trends