Skip to content
/ DGSlow Public

Codebase for the ACL 2023 paper: White-Box Multi-Objective Adversarial Attack on Dialogue Generation.

Notifications You must be signed in to change notification settings

yul091/DGSlow

Repository files navigation

DGSlow

Codebase for the ACL 2023 paper: "White-Box Multi-Objective Adversarial Attack on Dialogue Generation" (PDF).

Quickstart

Setup Environment

  • python 3.10.8
  • pytorch 1.13.0+
  • Install dependencies
pip install -r requirements.txt

Train and evaluate a model on a specific task(s)

  • BART in Blended Skill Talk:
python train_seq2seq.py --model_name_or_path facebook/bart-base --dataset blended_skill_talk --output_dir results/bart-base
  • DialoGPT in Empathetic Dialogues:
python train_clm.py --model_name_or_path microsoft/DialoGPT-small --dataset empathetic_dialogues --output_dir results/dialogpt-small

Attack a pre-trained model

  • Structure attack on DialoGPT-small in Blended Skill Talk:
python attack.py --attack_strategy structure --model_name_or_path results/bart-base --dataset blended_skill_talk
  • DF attack on bart-base in Empathetic Dialogues:
python attack.py --attack_strategy FD --model_name_or_path results/bart-base --dataset empathetic_dialogues

Transfer attack

  • Transfer attack from DialoGPT-small to bart-base in Blended Skill Talk:
python eval.py --file ${FILE} --orig_model bart-base --victim_model dialogpt-small --dataset BST --out_dir logging

Citation

Please cite the paper in your publications if you find this repo useful:

@inproceedings{li2023white,
  title={White-Box Multi-Objective Adversarial Attack on Dialogue Generation},
  author={Li, Yufei and Li, Zexin and Gao, Yingfan and Liu, Cong},
  booktitle={Annual Meeting of the Association for Computational Linguistics (ACL)},
  year={2023}
}

Acknowledgement

Our implementation is based on OpenAttack. We would like to thank the authors for their open source code.

About

Codebase for the ACL 2023 paper: White-Box Multi-Objective Adversarial Attack on Dialogue Generation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages