-
Notifications
You must be signed in to change notification settings - Fork 166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tree of Attacks #446
Tree of Attacks #446
Conversation
…ttacks. Updated requirements.txt to include numpy.
Need to test the code, run generation, and write probes. |
… ast.literal_eval with regular expression. Add small handful of jailbreaks successful against gpt-3.5-turbo and gpt-4.
replaced pathlib call w/. garak._config.basedir also - is there an interfaces to running full TAP locally via the attack manager? it looks like TAPProbe uses a cached set, which is great for getting off the gorund, but it'd be good to include a probe that runs full TAP |
I'll update the probe with a |
… as a parameter in run_tap function. Rename current TAPProbe to TapCachedProbe. Add TAPProbe and PAIRProbe.
… ast.literal_eval with regular expression. Add small handful of jailbreaks successful against gpt-3.5-turbo and gpt-4.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm!
the config route doesn't offer enough bandwidth for this plugin & needs revision, let's track that in another issue/pr
Initial commit containing full, untested implementation for tree of attacks.
Updated requirements.txt to include numpy.