Boosting adversarial robustness via self-paced adversarial training

Lirong He; Qingzhong Ai; Xincheng Yang; Yazhou Ren; Qifan Wang; Zenglin Xu

doi:10.1016/j.neunet.2023.08.063

Boosting adversarial robustness via self-paced adversarial training

Neural Netw. 2023 Oct:167:706-714. doi: 10.1016/j.neunet.2023.08.063. Epub 2023 Sep 9.

Authors

Lirong He¹, Qingzhong Ai², Xincheng Yang², Yazhou Ren², Qifan Wang³, Zenglin Xu⁴

Affiliations

¹ School of Information Science and Engineering, Chongqing Jiaotong University, Chongqing, China; School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
² School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
³ Meta AI, Menlo Park, CA, USA.
⁴ School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, China; School of Computer Science and Technology, Harbin Institute of Technology Shenzhen, Shenzhen, Guangdong, China; Peng Cheng Lab, Shenzhen, Guangdong, China. Electronic address: xuzenglin@hit.edu.cn.

PMID: 37729786
DOI: 10.1016/j.neunet.2023.08.063

Abstract

Adversarial training is considered one of the most effective methods to improve the adversarial robustness of deep neural networks. Despite the success, it still suffers from unsatisfactory performance and overfitting. Considering the intrinsic mechanism of adversarial training, recent studies adopt the idea of curriculum learning to alleviate overfitting. However, this also introduces new issues, that is, lacking the quantitative criterion for attacks' strength and catastrophic forgetting. To mitigate such issues, we propose the self-paced adversarial training (SPAT), which explicitly builds the learning process of adversarial training based on adversarial examples of the whole dataset. Specifically, our model is first trained with "easy" adversarial examples, and then is continuously enhanced by gradually adding "complex" adversarial examples. This way strengthens the ability to fit "complex" adversarial examples while holding in mind "easy" adversarial samples. To balance adversarial examples between classes, we determine the difficulty of the adversarial examples locally in each class. Notably, this learning paradigm can also be incorporated into other advanced methods for further boosting adversarial robustness. Experimental results show the effectiveness of our proposed model against various attacks on widely-used benchmarks. Especially, on CIFAR100, SPAT provides a boost of 1.7% (relatively 5.4%) in robust accuracy on the PGD10 attack and 3.9% (relatively 7.2%) in natural accuracy for AWP.

Keywords: Adversarial robustness; Adversarial training; Self-paced learning.

MeSH terms

Benchmarking*
Learning*
Neural Networks, Computer