Perturbation diversity certificates robust generalization

Zhuang Qian; Shufei Zhang; Kaizhu Huang; Qiufeng Wang; Xinping Yi; Bin Gu; Huan Xiong

doi:10.1016/j.neunet.2024.106117

Perturbation diversity certificates robust generalization

Neural Netw. 2024 Apr:172:106117. doi: 10.1016/j.neunet.2024.106117. Epub 2024 Jan 8.

Authors

Zhuang Qian¹, Shufei Zhang², Kaizhu Huang³, Qiufeng Wang⁴, Xinping Yi⁵, Bin Gu⁶, Huan Xiong⁷

Affiliations

¹ Department of Electrical Engineering and Electronics, University of Liverpool, United Kingdom; School of Advanced Technology, Xi'an Jiaotong-Liverpool University, China.
² Shanghai Artificial Intelligence Laboratory, China.
³ Data Science Research Center, Duke Kunshan University, China. Electronic address: kaizhu.huang@dukekunshan.edu.cn.
⁴ School of Advanced Technology, Xi'an Jiaotong-Liverpool University, China. Electronic address: qiufeng.wang@xjtlu.edu.cn.
⁵ Department of Electrical Engineering and Electronics, University of Liverpool, United Kingdom.
⁶ Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates.
⁷ Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates; Institute for Advanced Study in Mathematics, Harbin Institute of Technology, China.

PMID: 38232423
DOI: 10.1016/j.neunet.2024.106117

Abstract

Whilst adversarial training has been proven to be one most effective defending method against adversarial attacks for deep neural networks, it suffers from over-fitting on training adversarial data and thus may not guarantee the robust generalization. This may result from the fact that the conventional adversarial training methods generate adversarial perturbations usually in a supervised way so that the resulting adversarial examples are highly biased towards the decision boundary, leading to an inhomogeneous data distribution. To mitigate this limitation, we propose to generate adversarial examples from a perturbation diversity perspective. Specifically, the generated perturbed samples are not only adversarial but also diverse so as to certify robust generalization and significant robustness improvement through a homogeneous data distribution. We provide theoretical and empirical analysis, establishing a foundation to support the proposed method. As a major contribution, we prove that promoting perturbations diversity can lead to a better robust generalization bound. To verify our methods' effectiveness, we conduct extensive experiments over different datasets (e.g., CIFAR-10, CIFAR-100, SVHN) with different adversarial attacks (e.g., PGD, CW). Experimental results show that our method outperforms other state-of-the-art (e.g., PGD and Feature Scattering) in robust generalization performance.

Keywords: Adversarial examples; Adversarial robustness; Robust generalization.

MeSH terms

Generalization, Psychological*
Neural Networks, Computer*