A Distributed Black-Box Adversarial Attack Based on Multi-Group Particle Swarm Optimization

Naufal Suryanto; Hyoeun Kang; Yongsu Kim; Youngyeo Yun; Harashta Tatimma Larasati; Howon Kim

doi:10.3390/s20247158

A Distributed Black-Box Adversarial Attack Based on Multi-Group Particle Swarm Optimization

Sensors (Basel). 2020 Dec 14;20(24):7158. doi: 10.3390/s20247158.

Authors

Naufal Suryanto¹, Hyoeun Kang¹, Yongsu Kim¹, Youngyeo Yun¹, Harashta Tatimma Larasati¹, Howon Kim¹

Affiliation

¹ School of Computer Science and Engineering, Pusan National University, Busan 609735, Korea.

Abstract

Adversarial attack techniques in deep learning have been studied extensively due to its stealthiness to human eyes and potentially dangerous consequences when applied to real-life applications. However, current attack methods in black-box settings mainly employ a large number of queries for crafting their adversarial examples, hence making them very likely to be detected and responded by the target system (e.g., artificial intelligence (AI) service provider) due to its high traffic volume. A recent proposal able to address the large query problem utilizes a gradient-free approach based on Particle Swarm Optimization (PSO) algorithm. Unfortunately, this original approach tends to have a low attack success rate, possibly due to the model's difficulty of escaping local optima. This obstacle can be overcome by employing a multi-group approach for PSO algorithm, by which the PSO particles can be redistributed, preventing them from being trapped in local optima. In this paper, we present a black-box adversarial attack which can significantly increase the success rate of PSO-based attack while maintaining a low number of query by launching the attack in a distributed manner. Attacks are executed from multiple nodes, disseminating queries among the nodes, hence reducing the possibility of being recognized by the target system while also increasing scalability. Furthermore, we utilize Multi-Group PSO with Random Redistribution (MGRR-PSO) for perturbation generation, performing better than the original approach against local optima, thus achieving a higher success rate. Additionally, we propose to efficiently remove excessive perturbation (i.e, perturbation pruning) by utilizing again the MGRR-PSO rather than a standard iterative method as used in the original approach. We perform five different experiments: comparing our attack's performance with existing algorithms, testing in high-dimensional space in ImageNet dataset, examining our hyperparameters (i.e., particle size, number of clients, search boundary), and testing on real digital attack to Google Cloud Vision. Our attack proves to obtain a 100% success rate on MNIST and CIFAR-10 datasets and able to successfully fool Google Cloud Vision as a proof of the real digital attack by maintaining a lower query and wide applicability.

Keywords: adversarial examples; distributed attack; particle swarm optimization.

MeSH terms

Algorithms*
Artificial Intelligence*
Humans

Abstract

MeSH terms

Grants and funding