🐯 Fix LigerKernel for SFTTrainer #2940

lewtun · 2025-02-23T19:20:55Z

What does this PR do?

Fixes an issue where use_liger=True threw an error in the loss computation because the fix in #2874 was partially reverted in this line of #2890. Without this change, one gets errors on the loss computation

[rank0]:     shift_logits = outputs.logits[..., :-1, :].contiguous()
[rank0]:                    ~~~~~~~~~~~~~~^^^^^^^^^^^^^
[rank0]: TypeError: 'NoneType' object is not subscriptable

Command to test:

trl sft --model_name_or_path Qwen/Qwen2.5-0.5B     --dataset_name trl-lib/Capybara     --output_dir Qwen2.5-0.5B-SFT --use_liger true

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2025-02-23T19:25:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-02-23T19:40:48Z

trl/trainer/sft_trainer.py

@@ -173,7 +173,6 @@ def __init__(
            )
        if isinstance(model, str):
            model = self._create_model_from_path(model, args)
-        self.use_liger = is_liger_kernel_available() and isinstance(model, AutoLigerKernelForCausalLM)


What if the model used is already a liger model (and args.use_liger = False)?

Good point, but as recommended by the Liger maintainer, we shouldn't be passing Liger models at the init (just patched via the config)

Should we deprecate passing the Liger model to the trainer or would you prefer an alternative?

It seems that this command doesn't achieve what it's supposed to:

>>> from liger_kernel.transformers import AutoLigerKernelForCausalLM >>> model = AutoLigerKernelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B") Applied Liger kernels to Qwen2 Sliding Window Attention is enabled but not implemented for `sdpa`; unexpected results may be encountered. >>> isinstance(model, AutoLigerKernelForCausalLM) False

I don't know of an easy way to test whether a model is liger (perhaps @ByronHsu does?).

Anyway, with your change you can still pass a liger model to the trainer. But you'll need to specify use_liger=True. Which sounds good to me.

This line doesn't work when a Liger Model is converted to PEFT before passing into the trainer. It does not respect args.use_liger either.

Good point, but as recommended by the Liger maintainer, we shouldn't be passing Liger models at the init (just patched via the config)

Should we deprecate passing the Liger model to the trainer or would you prefer an alternative?

Deprecating it might cause some problem. LoRA+ requires the model instance to be created before hand (to create optimizer). The flag use_liger does not convert PEFT wrapped model to liger model.

model = get_peft_model(model, lora_config) optimizer = create_loraplus_optimizer( model=model, optimizer_cls=torch.optim.AdamW, lr=lr, eps=eps, betas=betas, weight_decay=weight_decay, loraplus_lr_ratio=loraplus_lr_ratio, )

The flag use_liger does not convert PEFT wrapped model to liger model.

in sft_tariner.py:

if args.use_liger: if not is_liger_kernel_available(): raise ImportError("Please install Liger-kernel for use_liger=True") model = AutoLigerKernelForCausalLM.from_pretrained(model_path, **model_init_kwargs) else: model = AutoModelForCausalLM.from_pretrained(model_path, **model_init_kwargs) return model

Also, many VLMs can't be loaded with AutoLigerKernelForCausalLM. For example, monkey patching apply_liger_kernel_to_qwen2_vl() is required for Qwen2-VL

edbeeching · 2025-02-24T08:52:49Z

@kashif @qgallouedec @lewtun ,

I think Liger models are now supported nativately in in transformers if the use_liger_kernel==True flag is set, perhaps we can drop the support for this in the SFTTrainer and use the native transformers implementation?

kashif · 2025-02-24T09:13:40Z

i think so too... we will need to pin the transformer version but yes should be a better soluton

qgallouedec · 2025-02-24T11:41:07Z

Thanks @edbeeching!

i think so too... we will need to pin the transformer version but yes should be a better soluton

After checking, it seems like use_liger_kernel exists for at least 4.46, which is the min version in TRL. So we shouldn't need to bump transformers

https://github.com/huggingface/transformers/blob/052e652d6d53c2b26ffde87e039b723949a53493/src/transformers/training_args.py#L1521

qgallouedec · 2025-02-24T11:43:06Z

it was introduced in 4.45

lewtun · 2025-02-24T15:18:41Z

Thanks for the pointer to transformers! Just to confirm, the current PR is fine to merge since the model init is taken care of in this line? https://github.com/huggingface/trl/pull/2940/files#r1967113819

If yes, feel free to merge if I'm offline :)

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Fix LigerKernel for SFTTrainer

fd6da4c

lewtun requested review from kashif and qgallouedec February 23, 2025 19:21

qgallouedec reviewed Feb 23, 2025

View reviewed changes

qgallouedec approved these changes Feb 23, 2025

View reviewed changes

Merge branch 'main' into fix-liger-sft

95d6c13

Merge branch 'main' into fix-liger-sft

5be79a3

qgallouedec changed the title ~~Fix LigerKernel for SFTTrainer~~ 🐯 Fix LigerKernel for SFTTrainer Feb 24, 2025

qgallouedec merged commit 5c05913 into main Feb 24, 2025
14 checks passed

qgallouedec deleted the fix-liger-sft branch February 24, 2025 16:29

qgallouedec mentioned this pull request Feb 24, 2025

⚰️ Deprecate liger-kernel #2949

Merged

qgallouedec added a commit that referenced this pull request Feb 25, 2025

🐯 Fix LigerKernel for SFTTrainer (#2940)

e437710

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

wizeng23 mentioned this pull request Mar 10, 2025

Update TRL to 0.15 and fix Liger/dataset code oumi-ai/oumi#1507

Merged

4 tasks

jhinpan pushed a commit to jhinpan/trl-jin that referenced this pull request Mar 12, 2025

🐯 Fix LigerKernel for SFTTrainer (huggingface#2940)

ef445e6

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐯 Fix LigerKernel for SFTTrainer #2940

🐯 Fix LigerKernel for SFTTrainer #2940

lewtun commented Feb 23, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 23, 2025

qgallouedec Feb 23, 2025

lewtun Feb 23, 2025

qgallouedec Feb 23, 2025

BenasdTW Feb 24, 2025

BenasdTW Feb 24, 2025

BenasdTW Feb 24, 2025

BenasdTW Feb 24, 2025

edbeeching commented Feb 24, 2025

kashif commented Feb 24, 2025

qgallouedec commented Feb 24, 2025 •

edited

Loading

qgallouedec commented Feb 24, 2025 •

edited

Loading

lewtun commented Feb 24, 2025 •

edited

Loading

🐯 Fix LigerKernel for SFTTrainer #2940

🐯 Fix LigerKernel for SFTTrainer #2940

Conversation

lewtun commented Feb 23, 2025 • edited Loading

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Feb 23, 2025

qgallouedec Feb 23, 2025

Choose a reason for hiding this comment

lewtun Feb 23, 2025

Choose a reason for hiding this comment

qgallouedec Feb 23, 2025

Choose a reason for hiding this comment

BenasdTW Feb 24, 2025

Choose a reason for hiding this comment

BenasdTW Feb 24, 2025

Choose a reason for hiding this comment

BenasdTW Feb 24, 2025

Choose a reason for hiding this comment

BenasdTW Feb 24, 2025

Choose a reason for hiding this comment

edbeeching commented Feb 24, 2025

kashif commented Feb 24, 2025

qgallouedec commented Feb 24, 2025 • edited Loading

qgallouedec commented Feb 24, 2025 • edited Loading

lewtun commented Feb 24, 2025 • edited Loading

lewtun commented Feb 23, 2025 •

edited

Loading

qgallouedec commented Feb 24, 2025 •

edited

Loading

qgallouedec commented Feb 24, 2025 •

edited

Loading

lewtun commented Feb 24, 2025 •

edited

Loading