Updated Trainer's liger-kernel integration to call correct patching API #33502

shimizust · 2024-09-16T00:52:05Z

What does this PR do?

The Trainer previously added a use_liger_kernel flag to apply liger kernels to the given model based on model type (Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer #32860). However, we realized that calling patching APIs after the model has been initialized will only partially patch with Liger kernels (see https://github.com/linkedin/Liger-Kernel/pull/199/files for more details).
Updated the Trainer code to invoke the new patching API that correctly patches model instance variables post-model initialization.
Addresses Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to HuggingFace #32861

Testing

pytest tests/trainer/test_trainer.py::TrainerIntegrationTest::test_use_liger_kernel_patching tests/trainer/test_trainer.py::TrainerIntegrationTest::test_use_liger_kernel_trainer
================================================================================================================================================ test session starts ================================================================================================================================================
platform linux -- Python 3.10.14, pytest-7.4.4, pluggy-1.0.0
rootdir: /home/jobuser/transformers
configfile: pyproject.toml
plugins: xdist-3.6.1, timeout-2.3.1, rich-0.1.1, lipy-config-base-32.0.38, lipy-fabric-35.3.13, lipy-test-8.0.73, datadir-1.3.1
collected 2 items                                                                                                                                                                                                                           

tests/trainer/test_trainer.py ..                                                                                                                                                                                                      [100%]

============================================================================================================= warnings summary ==============================================================================================================
tests/trainer/test_trainer.py::TrainerIntegrationTest::test_use_liger_kernel_trainer
  /home/jobuser/transformers/.venv/lib/python3.10/site-packages/torch/nn/parallel/_functions.py:68: UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector.
    warnings.warn('Was asked to gather along dimension 0, but all '

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
======================================================================================================= 2 passed, 1 warning in 7.34s ========================================================================================================

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

cc @SunMarc @ArthurZucker

shimizust · 2024-09-16T00:52:55Z

cc @ByronHsu @JasonZhu1313

JasonZhu1313 · 2024-09-16T17:47:48Z

LGTM thanks for the great work!

SunMarc

LGTM ! Thanks for updating the API !

ArthurZucker

Thanks for updating, in time for the next release that will include both

…PI (huggingface#33502) * Updated liger-kernel integration in Trainer to call correct patching API * Fixed styling

shimizust marked this pull request as ready for review September 16, 2024 00:52

shimizust added 2 commits September 16, 2024 17:42

Updated liger-kernel integration in Trainer to call correct patching API

166b2bd

Fixed styling

26ea9e2

shimizust force-pushed the sshimizu/liger-fix branch from 41842cb to 26ea9e2 Compare September 16, 2024 17:42

qingquansong approved these changes Sep 16, 2024

View reviewed changes

SunMarc approved these changes Sep 16, 2024

View reviewed changes

SunMarc requested a review from ArthurZucker September 16, 2024 23:56

ArthurZucker approved these changes Sep 17, 2024

View reviewed changes

ArthurZucker merged commit ba1f1dc into huggingface:main Sep 17, 2024
21 of 23 checks passed

ryankert01 mentioned this pull request Sep 18, 2024

Support Yi-Coder linkedin/Liger-Kernel#208

Closed

ambroser53 mentioned this pull request Oct 8, 2024

PeftModel is not an instance of PreTrainedModel. No liger kernels will be applied. #34016

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Trainer's liger-kernel integration to call correct patching API #33502

Updated Trainer's liger-kernel integration to call correct patching API #33502

shimizust commented Sep 16, 2024 •

edited

Loading

shimizust commented Sep 16, 2024

JasonZhu1313 commented Sep 16, 2024

SunMarc left a comment

ArthurZucker left a comment

Updated Trainer's liger-kernel integration to call correct patching API #33502

Updated Trainer's liger-kernel integration to call correct patching API #33502

Conversation

shimizust commented Sep 16, 2024 • edited Loading

What does this PR do?

Testing

Before submitting

Who can review?

shimizust commented Sep 16, 2024

JasonZhu1313 commented Sep 16, 2024

SunMarc left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

shimizust commented Sep 16, 2024 •

edited

Loading