PEFT fix: T5 prefix-tuning with new cache format #34312

zucchini-nlp · 2024-10-22T13:21:19Z

What does this PR do?

Fixes prefix tuning for T5 models which started breaking after the recent compile-compatibility PR. However, note this will not enable correct prefix tuning unless huggingface/peft#2096 (review) is merged

The main issue was that in new cache format we don't concatenate new key/values with cached key/values if past_key_values.is_updated. That issue is fixed on PEFT side by init a cache object and setting is_updated=False.

Another issue is the mask shape mismatch since we have to extend cross_attention_mask to account for new virtual tokens. Current PR adds enables it but I am also thinking if it will be possible to do in PEFT and pass already extended attention masks?

Tested with the below code for T5:

import torch
from transformers import AutoModelForSeq2SeqLM
from peft import PrefixTuningConfig, get_peft_model

inputs = {
    "input_ids": torch.tensor([[1, 2, 3, 4, 5, 6, 7]]),
    "decoder_input_ids": torch.tensor([[1, 2, 3, 4, 5]]),
    "attention_mask": torch.tensor([[1, 1, 1, 1, 1, 1, 1]]),
}
model_id = "ybelkada/tiny-random-T5ForConditionalGeneration-calibrated"
model = AutoModelForSeq2SeqLM.from_pretrained(model_id)
model(**inputs)

config = PrefixTuningConfig(num_virtual_tokens=20, task_type="SEQ_2_SEQ_LM")
model = get_peft_model(model, config)

output = model(**inputs)

cc @BenjaminBossan wdyt?

HuggingFaceDocBuilderDev · 2024-10-22T13:48:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2024-10-22T15:24:24Z

Just to update, we're internally discussing other options.

fix peft tuning

e940ccb

zucchini-nlp changed the title ~~T5 compile~~ PEFT fix: T5 prefix-tuning with new cache format Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PEFT fix: T5 prefix-tuning with new cache format #34312

PEFT fix: T5 prefix-tuning with new cache format #34312

zucchini-nlp commented Oct 22, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 22, 2024

BenjaminBossan commented Oct 22, 2024

PEFT fix: T5 prefix-tuning with new cache format #34312

Are you sure you want to change the base?

PEFT fix: T5 prefix-tuning with new cache format #34312

Conversation

zucchini-nlp commented Oct 22, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 22, 2024

BenjaminBossan commented Oct 22, 2024

zucchini-nlp commented Oct 22, 2024 •

edited

Loading