-
Notifications
You must be signed in to change notification settings - Fork 259
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add FuseChat-3.0 models to AlpacaEval
Dear AlpacaEval Team, We are researchers from FuseAI, and we are excited to create a pull request to contribute our latest FuseChat-3.0 models to your leaderboard. Our submission includes the following models: - **FuseChat-Llama-3.1-8B-Instruct** - **FuseChat-Llama-3.2-3B-Instruct** - **FuseChat-Llama-3.2-1B-Instruct** - **FuseChat-Gemma-2-9B-Instruct** - **FuseChat-Qwen-2.5-7B-Instruct** For more details about our models, please refer to the following resources: - **Hugging Face Collection**: [FuseChat-3.0](https://huggingface.co/collections/FuseAI/fusechat-30-6752d18dec430bad7a236a75) - **Paper**: [WRPO](https://arxiv.org/abs/2412.03187) - **Blog**: [FuseChat-3.0 Blog](https://slit-ai.github.io/FuseChat-3.0/) - **GitHub Repo**: [FuseChat-3.0 Repository](https://github.com/SLIT-AI/FuseChat-3.0) Thank you for your consideration! Please let us know if you need any modification for this PR. Best regards, Yang On behalf of the FuseAI Team
- Loading branch information
Showing
20 changed files
with
64,540 additions
and
0 deletions.
There are no files selected for viewing
4,832 changes: 4,832 additions & 0 deletions
4,832
results/FuseChat-Gemma-2-9B-Instruct/model_outputs.json
Large diffs are not rendered by default.
Oops, something went wrong.
8,052 changes: 8,052 additions & 0 deletions
8,052
results/FuseChat-Gemma-2-9B-Instruct/weighted_alpaca_eval_gpt4_turbo/annotations.json
Large diffs are not rendered by default.
Oops, something went wrong.
4,832 changes: 4,832 additions & 0 deletions
4,832
results/FuseChat-Llama-3.1-8B-Instruct/model_outputs.json
Large diffs are not rendered by default.
Oops, something went wrong.
8,052 changes: 8,052 additions & 0 deletions
8,052
results/FuseChat-Llama-3.1-8B-Instruct/weighted_alpaca_eval_gpt4_turbo/annotations.json
Large diffs are not rendered by default.
Oops, something went wrong.
4,832 changes: 4,832 additions & 0 deletions
4,832
results/FuseChat-Llama-3.2-1B-Instruct/model_outputs.json
Large diffs are not rendered by default.
Oops, something went wrong.
8,052 changes: 8,052 additions & 0 deletions
8,052
results/FuseChat-Llama-3.2-1B-Instruct/weighted_alpaca_eval_gpt4_turbo/annotations.json
Large diffs are not rendered by default.
Oops, something went wrong.
4,832 changes: 4,832 additions & 0 deletions
4,832
results/FuseChat-Llama-3.2-3B-Instruct/model_outputs.json
Large diffs are not rendered by default.
Oops, something went wrong.
8,052 changes: 8,052 additions & 0 deletions
8,052
results/FuseChat-Llama-3.2-3B-Instruct/weighted_alpaca_eval_gpt4_turbo/annotations.json
Large diffs are not rendered by default.
Oops, something went wrong.
4,832 changes: 4,832 additions & 0 deletions
4,832
results/FuseChat-Qwen-2.5-7B-Instruct/model_outputs.json
Large diffs are not rendered by default.
Oops, something went wrong.
8,052 changes: 8,052 additions & 0 deletions
8,052
results/FuseChat-Qwen-2.5-7B-Instruct/weighted_alpaca_eval_gpt4_turbo/annotations.json
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
17 changes: 17 additions & 0 deletions
17
src/alpaca_eval/models_configs/FuseChat-Gemma-2-9B-Instruct/configs.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
FuseChat-Gemma-2-9B-Instruct: | ||
completions_kwargs: | ||
model_name: FuseAI/FuseChat-Gemma-2-9B-Instruct | ||
model_kwargs: | ||
dtype: bfloat16 | ||
max_new_tokens: 4096 | ||
temperature: 0.5 | ||
top_p: 1.0 | ||
batch_size: 1000 | ||
use_beam_search: true | ||
stop_token_ids: | ||
- 1 | ||
- 107 | ||
fn_completions: vllm_local_completions | ||
pretty_name: FuseChat-Gemma-2-9B-Instruct | ||
prompt_template: FuseChat-Gemma-2-9B-Instruct/prompt.txt | ||
link: https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct |
3 changes: 3 additions & 0 deletions
3
src/alpaca_eval/models_configs/FuseChat-Gemma-2-9B-Instruct/prompt.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
<start_of_turn>user | ||
{instruction}<end_of_turn> | ||
<start_of_turn>model |
21 changes: 21 additions & 0 deletions
21
src/alpaca_eval/models_configs/FuseChat-Llama-3.1-8B-Instruct/configs.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
FuseChat-Llama-3.1-8B-Instruct: | ||
completions_kwargs: | ||
model_name: FuseAI/FuseChat-Llama-3.1-8B-Instruct | ||
model_kwargs: | ||
dtype: bfloat16 | ||
max_new_tokens: 4096 | ||
temperature: 0.6 | ||
top_k: 50 | ||
top_p: 0.9 | ||
presence_penalty: 0.1 | ||
frequency_penalty: 0.1 | ||
batch_size: 1000 | ||
use_beam_search: true | ||
stop_token_ids: | ||
- 128001 | ||
- 128009 | ||
fn_completions: vllm_local_completions | ||
pretty_name: FuseChat-Llama-3.1-8B-Instruct | ||
prompt_template: FuseChat-Llama-3.1-8B-Instruct/prompt.txt | ||
link: https://huggingface.co/FuseAI/FuseChat-Llama-3.1-8B-Instruct | ||
|
4 changes: 4 additions & 0 deletions
4
src/alpaca_eval/models_configs/FuseChat-Llama-3.1-8B-Instruct/prompt.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
<|begin_of_text|><|start_header_id|>user<|end_header_id|> | ||
|
||
{instruction}<|eot_id|><|start_header_id|>assistant<|end_header_id|> | ||
|
21 changes: 21 additions & 0 deletions
21
src/alpaca_eval/models_configs/FuseChat-Llama-3.2-1B-Instruct/configs.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
FuseChat-Llama-3.2-1B-Instruct: | ||
completions_kwargs: | ||
model_name: FuseAI/FuseChat-Llama-3.2-1B-Instruct | ||
model_kwargs: | ||
dtype: bfloat16 | ||
max_new_tokens: 4096 | ||
temperature: 0.6 | ||
top_k: 50 | ||
top_p: 0.9 | ||
presence_penalty: 0.1 | ||
frequency_penalty: 0.1 | ||
batch_size: 1000 | ||
use_beam_search: true | ||
stop_token_ids: | ||
- 128001 | ||
- 128009 | ||
fn_completions: vllm_local_completions | ||
pretty_name: FuseChat-Llama-3.2-1B-Instruct | ||
prompt_template: FuseChat-Llama-3.1-8B-Instruct/prompt.txt | ||
link: https://huggingface.co/FuseAI/FuseChat-Llama-3.2-1B-Instruct | ||
|
21 changes: 21 additions & 0 deletions
21
src/alpaca_eval/models_configs/FuseChat-Llama-3.2-3B-Instruct/configs.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
FuseChat-Llama-3.2-3B-Instruct: | ||
completions_kwargs: | ||
model_name: FuseAI/FuseChat-Llama-3.2-3B-Instruct | ||
model_kwargs: | ||
dtype: bfloat16 | ||
max_new_tokens: 4096 | ||
temperature: 0.6 | ||
top_k: 50 | ||
top_p: 0.9 | ||
presence_penalty: 0.1 | ||
frequency_penalty: 0.1 | ||
batch_size: 1000 | ||
use_beam_search: true | ||
stop_token_ids: | ||
- 128001 | ||
- 128009 | ||
fn_completions: vllm_local_completions | ||
pretty_name: FuseChat-Llama-3.2-3B-Instruct | ||
prompt_template: FuseChat-Llama-3.1-8B-Instruct/prompt.txt | ||
link: https://huggingface.co/FuseAI/FuseChat-Llama-3.2-3B-Instruct | ||
|
18 changes: 18 additions & 0 deletions
18
src/alpaca_eval/models_configs/FuseChat-Qwen-2.5-7B-Instruct/configs.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
FuseChat-Qwen-2.5-7B-Instruct: | ||
completions_kwargs: | ||
model_name: FuseAI/FuseChat-Qwen-2.5-7B-Instruct | ||
model_kwargs: | ||
dtype: bfloat16 | ||
max_new_tokens: 4096 | ||
temperature: 0.7 | ||
top_p: 0.8 | ||
repetition_penalty: 1.05 | ||
top_k: 20 | ||
batch_size: 1000 | ||
stop_token_ids: | ||
- 151645 | ||
- 151643 | ||
fn_completions: vllm_local_completions | ||
pretty_name: FuseChat-Qwen-2.5-7B-Instruct | ||
prompt_template: FuseChat-Qwen-2.5-7B-Instruct/prompt.txt | ||
link: https://huggingface.co/FuseAI/FuseChat-Qwen-2.5-7B-Instruct |
5 changes: 5 additions & 0 deletions
5
src/alpaca_eval/models_configs/FuseChat-Qwen-2.5-7B-Instruct/prompt.txt
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
<|im_start|>system | ||
You are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|> | ||
<|im_start|>user | ||
{instruction}<|im_end|> | ||
<|im_start|>assistant |