Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?

This repository contains the PyTorch implementation of our ICLR 2023 paper titled "Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?".

Usage

Step 1: Train the reference model using clean data

python train_reference_model.py --robust_eps 4 --reference_path ./reference_model

You can control the robustness of the reference model by ajusting --robust_eps parameter. The reference model will be saved at --reference_path. This file can also be used to evaluate the attack performance, with poisoned data as input.

Step 2: Calculate the centroid for each class

python get_centroid.py --centroid_path ./centroid

The centroid will be saved at --centroid_path

Step 3: Generate poisons

python poison_generate.py --eps 8 --recipe push

The poison budget can be controlled by adjusting --eps. You can select the poisoning method by setting --recipe push corresponding to EntF-Push or --recipe pull corresponding to EntF-Pull.

Citation

@inproceedings{wen2023is,
    title={Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?},
    author={Rui Wen and Zhengyu Zhao and Zhuoran Liu and Michael Backes and Tianhao Wang and Yang Zhang},
    booktitle={International Conference on Learning Representations},
    year={2023},
    url={https://openreview.net/forum?id=zKvm1ETDOq}
}

Contact

If you are interested in our work, feel free to drop me an email at rui.wen@cispa.de

Acknowledgement

We would like to acknowledge the work of Fowl et al. for their excellent framework for generating poisons based on adversarial examples. Our code leverages their code for the gradient descent and poison saving parts.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
imgs		imgs
models		models
utils		utils
village		village
LICENSE		LICENSE
README.md		README.md
get_centroid.py		get_centroid.py
poison_generate.py		poison_generate.py
train_reference_model.py		train_reference_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?

Usage

Step 1: Train the reference model using clean data

Step 2: Calculate the centroid for each class

Step 3: Generate poisons

Citation

Contact

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

WenRuiUSTC/EntF

Folders and files

Latest commit

History

Repository files navigation

Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?

Usage

Step 1: Train the reference model using clean data

Step 2: Calculate the centroid for each class

Step 3: Generate poisons

Citation

Contact

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages