https://github.com/cgcl-codes/transferattacksurrogates

The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability". We study how to train surrogates model for boosting transfer attack.

Keywords

adversarial-attacks adversarial-training black-box-attack data-augmentation distribution-shift gradient-regularization sharpness-aware-minimization transfer-attack

Last synced: 9 months ago · JSON representation

Repository

The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability". We study how to train surrogates model for boosting transfer attack.

Basic Info

Host: GitHub
Owner: CGCL-codes
License: mit
Language: Python
Default Branch: main
Homepage: https://arxiv.org/abs/2307.07873
Size: 147 KB

Statistics

Stars: 14
Watchers: 4
Forks: 3
Open Issues: 1
Releases: 0

Topics

adversarial-attacks adversarial-training black-box-attack data-augmentation distribution-shift gradient-regularization sharpness-aware-minimization transfer-attack

Created almost 3 years ago · Last pushed almost 2 years ago

Metadata Files

Readme License

README.md

TransferAttackSurrogates

The implementation of our IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability"

Abstract

Adversarial examples for deep neural networks (DNNs) have been shown to be transferable: examples that successfully fool one white-box surrogate model can also deceive other black-box models with different architectures. Although a bunch of empirical studies have provided guidance on generating highly transferable adversarial examples, many of these findings fail to be well explained and even lead to confusing or inconsistent advice for practical use.

In this paper, we take a further step towards understanding adversarial transferability, with a particular focus on surrogate aspects. Starting from the intriguing “little robustness” phenomenon, where models adversarially trained with mildly perturbed adversarial samples can serve as better surrogates for transfer attacks, we attribute it to a trade-off between two dominant factors: model smoothness and gradient similarity. Our research focuses on their joint effects on transferability, rather than demonstrating the separate relationships alone. Through a combination of theoretical and empirical analyses, we hypothesize that the data distribution shift induced by off manifold samples in adversarial training is the reason that impairs gradient similarity.

Building on these insights, we further explore the impacts of prevalent data augmentation and gradient regularization on transferability and analyze how the trade-off manifests in various training methods, thus building a comprehensive blueprint for the regulation mechanisms behind transferability. Finally, we provide a general route for constructing superior surrogates to boost transferability, which optimizes both model smoothness and gradient similarity simultaneously, e.g., the combination of input gradient regularization and sharpnessaware minimization (SAM), validated by extensive experiments. In summary, we call for attention to the united impacts of these two factors for launching effective transfer attacks, rather than optimizing one while ignoring the other, and emphasize the crucial role of manipulating surrogate models.

Model Training

All the training methods reported in our paper are implemented in the train.py under the CIFAR_Train directory.

SAM

``` python train.py --arch resnet18 \ --dataset cifar10 \ --sam \ --rho 0.1 \ --save-dir ./cifar10-models/resnet18-sam-0.1 \ --epoch 200

```

Adversarial Training (AT)

python train.py --arch resnet18 \ --dataset cifar10 \ --robust \ --pgd-norm-type l2 \ --pgd-radius 0.5 \ --pgd-random-start \ --pgd-steps 10 \ --pgd-step-size 0.125 \ --save-dir ./cifar10-models/resnet18-adv-0.5 \ --epoch 200

Jacbian Regularization (JR)

Install jacobian_regularizer first:

pip install git+https://github.com/facebookresearch/jacobian_regularizer

python train.py --arch resnet18 \ --dataset cifar10 \ --reg \ --reg-type jr \ --jr-beta 0.05 \ --save-dir ./cifar10-models/resnet18-jr-0.05 \ --epoch 200

Input Regularization (IR)

python train.py --arch resnet18 \ --dataset cifar10 \ --reg \ --reg-type ig \ --ig-beta 0.1 \ --save-dir ./cifar10-models/resnet18-ir-0.1 \ --epoch 200

SAM & IR

python train.py --arch resnet18 \ --dataset cifar10 \ --reg \ --reg-type ig \ --ig-beta 0.1 \ --sam \ --rho 0.1 \ --save-dir ./cifar10-models/resnet18-sam-0.1-ir-0.1 \ --epoch 200

Hessian Computation

We use the get_hessian_eigenvalues_from_sample function in hessian.py to compute the dominant eigenvalue. Note, we set the which='LM' to the get the eigenvalue with the largest magnitude, and we report the its absolute value in our paper.

Owner

Name: CGCL-codes
Login: CGCL-codes
Kind: organization

Website: http://grid.hust.edu.cn/
Repositories: 35
Profile: https://github.com/CGCL-codes

CGCL/SCTS/BDTS Lab

GitHub Events

Total

Watch event: 5

Last Year

Watch event: 5

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/cgcl-codes/transferattacksurrogates

Science Score: 13.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

TransferAttackSurrogates

Abstract

Model Training

SAM

Adversarial Training (AT)

Jacbian Regularization (JR)

Input Regularization (IR)

SAM & IR

Hessian Computation

Owner

GitHub Events

Total

Last Year