Add `bos_token` and `add_generation_prompt` to the alpaca chat template #2322

minpeter · 2025-02-09T07:12:09Z

Description

Fixes and resolves issues with the alpaca template used when using the axolotl inference feature.

AS-IS

### Instruction: Describe the structure of an atom.

<Completion starts here>

TO-BE

<bos_token>### Instruction:
Describe the structure of an atom.

### Response:
<Completion starts here>

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

discord: minpeter

NanoCode012

Thanks for catching that!

xzuyn · 2025-02-10T16:27:22Z

That template would still have issues. It can't handle system turns, and also leaves \n\n after the last turn.

<bos_token>### Instruction:
<user turn>

### Response:
<assistant turn><eos_token>

### Instruction:
<user turn>

### Response:
<assistant turn><eos_token>

This template would fix those. Due to the way the original alpaca format handles system prompts, I've limited it to only having a system prompt on the first turn.

{{ bos_token }}{% for message in messages %}{% if message['role'] == 'system' and loop.first %}{{ message['content'] }}{% elif message['role'] == 'user' %}{{ '### Instruction:\n' + message['content'] }}{% elif message['role'] == 'assistant' %}{{ '### Response:\n' + message['content'] + eos_token }}{% endif %}{% if not loop.last %}{{ '\n\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '\n\n### Response:\n' }}{% endif %}

<bos_token><optional system turn>

### Instruction:
<user turn>

### Response:
<assistant turn><eos_token>

### Instruction:
<user turn>

### Response:
<assistant turn><eos_token>

Co-authored-by: xzuyn <xzuyn@users.noreply.github.com>

minpeter · 2025-02-10T17:32:06Z

Please review the changes and proceed with merging..! .cc @NanoCode012

NanoCode012 · 2025-02-11T02:12:08Z

Since we're adding an eos_token as well to separate turns (along others), let me think this through as we're modifying a common template.

fix alpaca add_generation_prompt

9e80efd

minpeter marked this pull request as draft February 9, 2025 07:12

Merge branch 'main' into fix-alpaca-chat-tmpl

9c4994b

minpeter changed the title ~~fix alpaca add_generation_prompt~~ Add bos_token and add_generation_prompt to the alpaca chat template Feb 9, 2025

minpeter changed the title ~~Add bos_token and add_generation_prompt to the alpaca chat template~~ Add eos_token and add_generation_prompt to the alpaca chat template Feb 9, 2025

minpeter changed the title ~~Add eos_token and add_generation_prompt to the alpaca chat template~~ Add bos_token and add_generation_prompt to the alpaca chat template Feb 9, 2025

minpeter marked this pull request as ready for review February 9, 2025 07:21

minpeter changed the title ~~Add bos_token and add_generation_prompt to the alpaca chat template~~ Add bos_token and add_generation_prompt to the alpaca chat template Feb 9, 2025

NanoCode012 approved these changes Feb 10, 2025

View reviewed changes

Alpaca template considering multi-turn

8edff5f

Co-authored-by: xzuyn <xzuyn@users.noreply.github.com>

minpeter requested a review from NanoCode012 February 10, 2025 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `bos_token` and `add_generation_prompt` to the alpaca chat template #2322

Add `bos_token` and `add_generation_prompt` to the alpaca chat template #2322

minpeter commented Feb 9, 2025 •

edited

Loading

NanoCode012 left a comment

xzuyn commented Feb 10, 2025

minpeter commented Feb 10, 2025

NanoCode012 commented Feb 11, 2025

Add bos_token and add_generation_prompt to the alpaca chat template #2322

Are you sure you want to change the base?

Add bos_token and add_generation_prompt to the alpaca chat template #2322

Conversation

minpeter commented Feb 9, 2025 • edited Loading

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

NanoCode012 left a comment

Choose a reason for hiding this comment

xzuyn commented Feb 10, 2025

minpeter commented Feb 10, 2025

NanoCode012 commented Feb 11, 2025

Add `bos_token` and `add_generation_prompt` to the alpaca chat template #2322

Add `bos_token` and `add_generation_prompt` to the alpaca chat template #2322

minpeter commented Feb 9, 2025 •

edited

Loading