Skip to content

SORRY-Bench/sorry-bench.github.io

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

This is the project page of the paper: SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

Releases

No releases published

Packages

No packages published

Languages

  • HTML 62.4%
  • Jupyter Notebook 32.1%
  • CSS 3.9%
  • JavaScript 1.5%
  • Python 0.1%