assistant prefill #2615

baberabb · 2025-01-07T15:12:56Z

add an assistant_prefill to the chat prompt.

This makes it so the assistant's responses include certain content at the start after the <|assistant|> token

added arc_challenge_chat and mmlu_llama (both chat) from llama evals

This reverts commit 6a97f83.

baberabb · 2025-01-15T01:49:44Z

with apply_chat_template and fewshot_as_multiturn

vllm (pretrained=meta-llama/Llama-3.1-8B-Instruct,enable_prefix_caching=True), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: auto

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
arc_challenge_chat	1	remove_whitespace	0	exact_match	↑	0.8157	±	0.0113

vllm (pretrained=meta-llama/Llama-3.1-8B-Instruct,enable_prefix_caching=True), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: auto

Groups	Version	Filter	n-shot	Metric		Value		Stderr
mmlu_llama	1	strict_match		exact_match	↑	0.6585	±	0.0038

There appears to be a discrepancy between our evaluation results and those reported in the Hugging Face eval repo for meta-llama/Llama-3.1-8B-Instruct model. However prompts are exactly the same

baberabb added 5 commits January 6, 2025 17:42

add assistant prefix

c2d6780

add arc_challenge from llama

d2ab5e1

nit

9699111

nit

268e01a

nit

18d0edb

baberabb requested a review from lintangsutawika as a code owner January 7, 2025 15:12

baberabb mentioned this pull request Jan 7, 2025

reproduce llama 3 evals #2557

Open

baberabb changed the title ~~Prefix~~ assistant prefill Jan 7, 2025

baberabb added 10 commits January 8, 2025 14:11

add assistant prefix

71c9788

add mmlu_llama

001abf6

nit

c84649a

nit

6a97f83

Revert "nit"

7225977

This reverts commit 6a97f83.

fix regex bug

6af0772

add assistant_prefix to vllm

572daa5

add Question:

fb3a02f

add mmlu_pro

d9bd8cc

add fewshot assistant_prefix

1399339

baberabb added 6 commits January 15, 2025 20:55

use assistant_prefill

278c0a8

typehints

6dec331

nits

3ee673d

nits

1ed35fe

add to docs

d2a8641

add readme

6d62432

baberabb merged commit 703fbff into main Jan 15, 2025
8 checks passed

baberabb deleted the prefix branch January 15, 2025 23:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assistant prefill #2615

assistant prefill #2615

baberabb commented Jan 7, 2025 •

edited

Loading

baberabb commented Jan 15, 2025

assistant prefill #2615

assistant prefill #2615

Conversation

baberabb commented Jan 7, 2025 • edited Loading

baberabb commented Jan 15, 2025

baberabb commented Jan 7, 2025 •

edited

Loading