Skip to content
@git-disl

git-disl

Pinned Loading

  1. PokeLLMon PokeLLMon Public

    Python 178 14

Repositories

Showing 10 of 67 repositories
  • awesome_LLM-harmful-fine-tuning-papers Public

    A survey on harmful fine-tuning attack for large language model

    git-disl/awesome_LLM-harmful-fine-tuning-papers’s past year of commit activity
    109 2 0 0 Updated Dec 26, 2024
  • awesome-LLM-game-agent-papers Public

    A Survey on Large Language Model-Based Game Agents

    git-disl/awesome-LLM-game-agent-papers’s past year of commit activity
    380 14 1 0 Updated Dec 8, 2024
  • PFT Public
    git-disl/PFT’s past year of commit activity
    Python 1 0 0 0 Updated Dec 6, 2024
  • Chameleon Public
    git-disl/Chameleon’s past year of commit activity
    Python 3 0 1 0 Updated Nov 18, 2024
  • Vaccine Public

    This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)

    git-disl/Vaccine’s past year of commit activity
    Shell 27 Apache-2.0 4 0 0 Updated Nov 18, 2024
  • Booster Public

    This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation".

    git-disl/Booster’s past year of commit activity
    Shell 8 Apache-2.0 0 0 0 Updated Nov 11, 2024
  • PokeLLMon Public
    git-disl/PokeLLMon’s past year of commit activity
    Python 178 14 1 0 Updated Oct 12, 2024
  • llm-topla Public
    git-disl/llm-topla’s past year of commit activity
    Jupyter Notebook 5 0 1 0 Updated Oct 4, 2024
  • Lisa Public

    This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)

    git-disl/Lisa’s past year of commit activity
    Python 12 Apache-2.0 0 0 0 Updated Sep 10, 2024
  • EnsembleBench Public

    A holistic framework for promoting high diversity ensemble learning.

    git-disl/EnsembleBench’s past year of commit activity
    Python 14 2 0 0 Updated Aug 27, 2024