EBench

Released code for the paper A Unified Framework for Benchmarking Social Group Bias in Large Language Models

Installation

We assume conda / miniconda has been download. Install the dependencies through conda:

# clone our repo
git clone https://github.com/Hongji1001/llm-bias.git
cd llm-bias
# remove old env if necessary
conda deactivate; conda env remove --name bias-benchmark
conda env create -f env.yaml --name bias-benchmark

Note: Before evaluation generation outputs, you should download the hard debiased version of the Google's Word2Vec embedding trained on Google News here.

Benchmark on Classification

# You can modify the setting.py to benchmark your own models.
python3 classification_benchmark.py  # for few shot evaluation
python3 classification_benchmark.py --eval_only # for zero-shot shot evaluation

Benchmark on Generation

# You can modify the setting.py to benchmark your own models.
python3 generation_benchmark.py --gen_only # generate completions only
python3 generation_benchmark.py --eval_only # evaluate the completions at --completions_path

# To reproduce the mitigation experiment, you should run the following:
bash scripts/debias_generation_model.sh meta-llama/Llama-2-7b-chat-hf checkpoint/meta-llama/Llama-2-7b-chat-hf data/Ebench_cnn_dailymail.jsonl 0,1,2,3
python3 generation_benchmark.py --model_name_or_path checkpoint/meta-llama/Llama-2-7b-chat-hf

Dataset Collection

# You can easily re-generate the datasets mentioned in our paper by following:
bash scripts/generate_dataset.py 20 10 40 bookcorpus|cnn_dailymail text

Debais

cd llm-unlearning
conda create --name unlearn python=3.11
conda activate unlearn
pip install -r requirements.txt
mkdir logs checkpoint
# run OPT-1.3b
bash scripts/debias_generation_model.sh facebook/opt-1.3b checkpoint/facebook-opt-1.3b_unlearn
# run llama-7b
bash scripts/debias_generation_model.sh meta-llama/Llama-2-7b-chat-hf checkpoint/meta-llama-Llama-2-7b-chat-hf_unlearn

Benchmark

python benchmark.py --model_name_or_path facebook/opt-1.3b --gen_only --dataset_paths "data/Ebench_bookcorpus.jsonl" "data/Ebench_cnn_dailymail.jsonl" "BOLD.jsonl"
python benchmark.py --model_name_or_path lmsys/vicuna-7b-v1.5 --gen_only --dataset_paths "data/Ebench_bookcorpus.jsonl" "data/Ebench_cnn_dailymail.jsonl" "BOLD.jsonl"
python benchmark.py --model_name_or_path mistralai/Mistral-7B-Instruct-v0.3 --gen_only --dataset_paths "data/Ebench_bookcorpus.jsonl" "data/Ebench_cnn_dailymail.jsonl" "BOLD.jsonl"

# after debias
python benchmark.py --model_name_or_path checkpoint/checkpoint/facebook-opt-1.3b_unlearn --gen_only --dataset_paths "data/Ebench_bookcorpus.jsonl" "data/Ebench_cnn_dailymail.jsonl" "BOLD.jsonl"
python benchmark.py --model_name_or_path checkpoint/meta-llama-Llama-2-7b-chat-hf_unlearn --gen_only --dataset_paths "data/Ebench_bookcorpus.jsonl" "data/Ebench_cnn_dailymail.jsonl" "BOLD.jsonl"

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
data		data
llm-unlearning		llm-unlearning
scripts		scripts
words		words
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
classification_benchmark.py		classification_benchmark.py
dataset.py		dataset.py
env.yaml		env.yaml
eval_clm.py		eval_clm.py
metrics.py		metrics.py
run_clm.py		run_clm.py
run_generation.sh		run_generation.sh
run_marabou.py		run_marabou.py
run_mlm.py		run_mlm.py
settings.py		settings.py
stat_token.py		stat_token.py
train_clm.py		train_clm.py
train_mlm.py		train_mlm.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EBench

Installation

Benchmark on Classification

Benchmark on Generation

Dataset Collection

Debais

Benchmark

About

Releases

Packages

Languages

License

Hongji1001/llm-bias

Folders and files

Latest commit

History

Repository files navigation

EBench

Installation

Benchmark on Classification

Benchmark on Generation

Dataset Collection

Debais

Benchmark

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages