DiscreteZoo

This repository is a fork of both the TextAttack and the TextAttack-Search-Benchmark repositories. The code in this repository is for the paper Don't Search for a Search Method, to appear at EMNLP 2021.

Setup

How to run

To reproduce all the experiments in the paper, run the python file grid_run_all.py. All results will be written to the folder grid_results.

Attacks and how to design a new attack

The attack_one method in an Attack takes as input an AttackedText, and outputs either a SuccessfulAttackResult if it succeeds or a FailedAttackResult if it fails.

TextAttack formulates an attack as consisting of four components: a goal function which determines if the attack has succeeded, constraints defining which perturbations are valid, a transformation that generates potential modifications given an input, and a search method which traverses through the search space of possible perturbations. The attack attempts to perturb an input text such that the model output fulfills the goal function (i.e., indicating whether the attack is successful) and the perturbation adheres to the set of constraints (e.g., grammar constraint, semantic similarity constraint). A search method is used to find a sequence of transformations that produce a successful adversarial example.

The attacks for the paper are implemented in the folder textattack/search_methods.

Goal Functions

A GoalFunction takes as input an AttackedText object, scores it, and determines whether the attack has succeeded, returning a GoalFunctionResult.

Constraints

A Constraint takes as input a current AttackedText, and a list of transformed AttackedTexts. For each transformed option, it returns a boolean representing whether the constraint is met.

Transformations

A Transformation takes as input an AttackedText and returns a list of possible transformed AttackedTexts. For example, a transformation might return all possible synonym replacements.

Search Methods

A SearchMethod takes as input an initial GoalFunctionResult and returns a final GoalFunctionResult The search is given access to the get_transformations function, which takes as input an AttackedText object and outputs a list of possible transformations filtered by meeting all of the attack’s constraints. A search consists of successive calls to get_transformations until the search succeeds (determined using get_goal_results) or is exhausted.

Citing our work

If you use our attacks or baselines for your research, please cite Don't Search for a Search Method

@misc{berger2021dont,
      title={Don't Search for a Search Method -- Simple Heuristics Suffice for Adversarial Text Attacks}, 
      author={Nathaniel Berger and Stefan Riezler and Artem Sokolov and Sebastian Ebert},
      year={2021},
      eprint={2109.07926},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Because this work is based on TextAttack, if you use TextAttack for your research, please cite the original authors of the framework. TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP.

@misc{morris2020textattack,
    title={TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP},
    author={John X. Morris and Eli Lifland and Jin Yong Yoo and Jake Grigsby and Di Jin and Yanjun Qi},
    year={2020},
    eprint={2005.05909},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
analysis		analysis
docs		docs
figures		figures
recipes		recipes
tests		tests
textattack		textattack
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
autoevaluation.ipynb		autoevaluation.ipynb
avg_neighbor_count.py		avg_neighbor_count.py
budget_graphs.sh		budget_graphs.sh
greedy_word_swap_lax_snli.csv		greedy_word_swap_lax_snli.csv
grid_run_all.py		grid_run_all.py
neighbor_counts_bargraph.svg		neighbor_counts_bargraph.svg
run_experiment.py		run_experiment.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiscreteZoo

Setup

How to run

Attacks and how to design a new attack

Goal Functions

Constraints

Transformations

Search Methods

Citing our work

About

Uh oh!

Releases

Packages

Languages

License

StatNLP/discretezoo

Folders and files

Latest commit

History

Repository files navigation

DiscreteZoo

Setup

How to run

Attacks and how to design a new attack

Goal Functions

Constraints

Transformations

Search Methods

Citing our work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages