Bridged adversarial training

Hoki Kim; Woojin Lee; Sungyoon Lee; Jaewook Lee

doi:10.1016/j.neunet.2023.08.024

Bridged adversarial training

Neural Netw. 2023 Oct:167:266-282. doi: 10.1016/j.neunet.2023.08.024. Epub 2023 Aug 22.

Authors

Hoki Kim¹, Woojin Lee², Sungyoon Lee³, Jaewook Lee⁴

Affiliations

¹ Institute of Engineering Research, Seoul National University, Gwanak-gu 08826, Republic of Korea.
² School of AI Convergence, Dongguk University-Seoul, Jung-gu 04620, Republic of Korea.
³ Department of Computer Science, Hanyang University, Seongdong-gu 04763, Republic of Korea.
⁴ Department of Industrial Engineering, Seoul National University, Gwanak-gu 08826, Republic of Korea. Electronic address: jaewook@snu.ac.kr.

PMID: 37666185
DOI: 10.1016/j.neunet.2023.08.024

Abstract

Adversarial robustness is considered a required property of deep neural networks. In this study, we discover that adversarially trained models might have significantly different characteristics in terms of margin and smoothness, even though they show similar robustness. Inspired by the observation, we investigate the effect of different regularizers and discover the negative effect of the smoothness regularizer on maximizing the margin. Based on the analyses, we propose a new method called bridged adversarial training that mitigates the negative effect by bridging the gap between clean and adversarial examples. We provide theoretical and empirical evidence that the proposed method provides stable and better robustness, especially for large perturbations.

Keywords: Adversarial defense; Adversarial robustness; Adversarial training; Neural networks.

MeSH terms

Neural Networks, Computer*