thu-coai
Conversational AI groups from Tsinghua University
Pinned Loading
Repositories
Showing 10 of 86 repositories
- ComplexBench Public
thu-coai/ComplexBench’s past year of commit activity - JailbreakDefense_GoalPriority Public
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
thu-coai/JailbreakDefense_GoalPriority’s past year of commit activity - SafeUnlearning Public
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
thu-coai/SafeUnlearning’s past year of commit activity - CritiqueLLM Public
thu-coai/CritiqueLLM’s past year of commit activity