Repositories
Showing 10 of 67 repositories
git-disl/awesome_LLM-harmful-fine-tuning-papers’s past year of commit activity
109
2
0
0
Updated Dec 26, 2024
git-disl/awesome-LLM-game-agent-papers’s past year of commit activity
git-disl/PFT’s past year of commit activity
Python
1
0
0
0
Updated Dec 6, 2024
git-disl/Chameleon’s past year of commit activity
Python
3
0
1
0
Updated Nov 18, 2024
Vaccine
Public
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
git-disl/Vaccine’s past year of commit activity
Shell
27
Apache-2.0
4
0
0
Updated Nov 18, 2024
Booster
Public
This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation".
git-disl/Booster’s past year of commit activity
Shell
8
Apache-2.0
0
0
0
Updated Nov 11, 2024
git-disl/PokeLLMon’s past year of commit activity
Python
178
14
1
0
Updated Oct 12, 2024
git-disl/llm-topla’s past year of commit activity
Jupyter Notebook
5
0
1
0
Updated Oct 4, 2024
Lisa
Public
This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)
git-disl/Lisa’s past year of commit activity
Python
12
Apache-2.0
0
0
0
Updated Sep 10, 2024
EnsembleBench
Public
A holistic framework for promoting high diversity ensemble learning.
git-disl/EnsembleBench’s past year of commit activity
Python
14
2
0
0
Updated Aug 27, 2024
You can’t perform that action at this time.