This is the project page of the paper: SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
forked from LLM-Tuning-Safety/LLM-Tuning-Safety.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
SORRY-Bench/sorry-bench.github.io
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- HTML 62.4%
- Jupyter Notebook 32.1%
- CSS 3.9%
- JavaScript 1.5%
- Python 0.1%