https://github.com/cgcl-codes/eclipse

This is the official code for the ESORICS 2024 paper "ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification"

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org, scholar.google
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.9%) to scientific vocabulary

Keywords

data-poisoning-attacks diffusion-models unlearnable-examples

Last synced: 5 months ago · JSON representation

Repository

This is the official code for the ESORICS 2024 paper "ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification"

Basic Info

Host: GitHub
Owner: CGCL-codes
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 2.52 MB

Statistics

Stars: 2
Watchers: 3
Forks: 1
Open Issues: 0
Releases: 0

Topics

data-poisoning-attacks diffusion-models unlearnable-examples

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme License

ECLIPSE

The official implementation of our ESORICS 2024 paper "ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification", by Xianlong Wang, Shengshan Hu, Yechao Zhang, Ziqi Zhou, Leo Yu Zhang, Peng Xu, Wei Wan, and Hai Jin.

Abstract

Clean-label indiscriminate poisoning attacks add invisible perturbations to correctly labeled training images, thus dramatically reducing the generalization capability of the victim models. Recently, some defense mechanisms have been proposed such as adversarial training, image transformation techniques, and image purification. However, these schemes are either susceptible to adaptive attacks, built on unrealistic assumptions, or only effective against specific poison types, limiting their universal applicability. In this research, we propose a more universally effective, more practical, and robust defense scheme called ECLIPSE. We first investigate the impact of Gaussian noise on the poisons and theoretically prove that any kind of poison will be largely assimilated when imposing sufficient random noise. In light of this, we assume the victim has access to an extremely limited number of clean images (a more practical scene) and subsequently enlarge this sparse set for training a denoising probabilistic model (a universal denoising tool). Then we begin by introducing Gaussian noise to absorb the poisons and then apply the model for denoising, resulting in a roughly purified dataset. Finally, to address the trade-off of the inconsistency in the assimilation sensitivity of different poisons by Gaussian noise, we propose a lightweight corruption compensation module to effectively eliminate residual poisons, providing a universal defense approach. Extensive experiments demonstrate that our defense approach outperforms 10 state-of-the-art defenses. We also propose an adaptive attack against ECLIPSE and verify the robustness of our defense scheme.

Latest Update

| Date | Event | |------------|----------| | 2024/06/17 | We have released the paper of ECLIPSE! | | 2024/06/16 | We have released the implementation of ECLIPSE! | | 2024/06/14 | ECLIPSE is acccepted by ESORICS 2024! |

Start Running ECLIPSE

Get code shell git clone https://github.com/CGCL-codes/ECLIPSE.git
Build environment shell cd ECLIPSE conda create -n ECLIPSE python=3.9 conda activate ECLIPSE pip install -r requirements.txt
Sprase Diffusion Purification Stage
Route 1: Purify poisoned datasets by yourself through the next three steps:
Download the sparse trained diffusion model checkpoint
- Please download the diffusion checkpoint (training the diffusion model with randomly selected 2000 test CIFAR-10 images, M=4, I=250K) at: ema0.9999250000.pt. Note that our diffusion training process is based on the Improved diffusion repository.
- Save this checkpoint in ECLIPSE/diff_ckpt/cifar10/test2000ps8000
Download the clean-label indiscriminate poisoned datasets (unlearnable datasets)
- Please download 8 types of CIFAR-10 unlearnable datasets at unlearnable-datasets.
- Save these poisoned datasets in ECLIPSE/poisoned_data/cifar10
Perform diffusion purification on poisoned datasets, e.g., EM dataset shell python purification.py --poison EM
Route 2: Or you can directly use the purified datasets:
- Please download 8 types of CIFAR-10 purified datasets at purified-datasets.
- Save these purified datasets in ECLIPSE/purified_data/cifar10/test2000ps8000/100/250000
- Note: The poisoned and purified datasets by model-dependent unlearnable schemes including EM, REM, EFP, and SEP are all crafted based on ResNet18 architecture. **********************************************************************************************
Perform Training on Purified Datasets with Lightweight Corruption Compensation Module shell python train.py --poison EM --arch resnet18 --pure Note that the final test accuracy result indicates the defense effectiveness of ECLIPSE.

Acknowledge

Some of our codes are built upon diffpure.

BibTex

If you find ECLIPSE both interesting and helpful, please consider citing us in your research or publications: bibtex @inproceedings{wang2024eclipse, title={ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification}, author={Wang, Xianlong and Hu, Shengshan and Zhang, Yechao and Zhou, Ziqi and Zhang, Leo Yu and Xu, Peng and Wan, Wei and Jin, Hai}, booktitle={Proceedings of the 29th European Symposium on Research in Computer Security (ESORICS'24)}, year={2024} }

Owner

Name: CGCL-codes
Login: CGCL-codes
Kind: organization

Website: http://grid.hust.edu.cn/
Repositories: 35
Profile: https://github.com/CGCL-codes

CGCL/SCTS/BDTS Lab

GitHub Events

Total

Watch event: 3
Fork event: 1

Last Year

Watch event: 3
Fork event: 1

Dependencies

requirements.txt pypi

pyyaml ==6.0.1
torch ==2.3.0
torchvision ==0.18.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/cgcl-codes/eclipse

Science Score: 23.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

ECLIPSE

Abstract

Latest Update

Start Running ECLIPSE

Acknowledge

BibTex

Owner

GitHub Events

Total

Last Year

Dependencies