Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

Promote qbits as itrex module #1399

Merged
merged 11 commits into from
Mar 21, 2024
Merged

Promote qbits as itrex module #1399

merged 11 commits into from
Mar 21, 2024

Conversation

zhewang1-intc
Copy link
Contributor

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Copy link

github-actions bot commented Mar 20, 2024

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Format Scan Tests workflow
Check ID Status Error details
format-scan (pylint) success
format-scan (bandit) success
format-scan (cloc) success
format-scan (cpplint) success

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py, intel_extension_for_transformers/transformers/llm/quantization/autograd/functions.py, intel_extension_for_transformers/transformers/llm/quantization/nn/modules.py, setup.py.

🟢 Optimize Unit Test workflow
Check ID Status Error details
optimize-unit-test-baseline success
optimize-unit-test-PR-test success
Genreate-OptimizeUT-Report success

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py, intel_extension_for_transformers/transformers/llm/quantization/autograd/functions.py, intel_extension_for_transformers/transformers/llm/quantization/nn/modules.py, setup.py, tests/CI/test_weight_only.py.

🟢 NeuralChat Unit Test
Check ID Status Error details
neuralchat-unit-test-baseline success
neuralchat-unit-test-PR-test success
Generate-NeuralChat-Report success

These checks are required after the changes to setup.py, intel_extension_for_transformers/transformers/llm/quantization/autograd/functions.py, intel_extension_for_transformers/transformers/llm/quantization/nn/modules.py, intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py.

🟢 Engine Unit Test workflow
Check ID Status Error details
engine-unit-test-baseline success
engine-unit-test-PR-test success
Genreate-Engine-Report success

These checks are required after the changes to setup.py, intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py, intel_extension_for_transformers/transformers/llm/quantization/autograd/functions.py, intel_extension_for_transformers/transformers/llm/quantization/nn/modules.py.

🟢 Windows Binary Test
Check ID Status Error details
Windows-Binary-Test success

These checks are required after the changes to setup.py, intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py.

🟢 Chat Bot Test workflow
Check ID Status Error details
call-inference-llama-2-7b-chat-hf / inference test success
call-inference-mpt-7b-chat / inference test success

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/quantization/autograd/functions.py, intel_extension_for_transformers/transformers/llm/quantization/nn/modules.py, intel_extension_for_transformers/transformers/llm/operator/csrc/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/CMakeLists.txt, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_packq_impl.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/include/bestla_weightonly_dispatcher.hpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_packq_impl.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/dispatcher/src/bestla_weightonly_dispatcher.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits.cpp, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_dropout.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_matmul.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_packq.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/test_weightonly.py, intel_extension_for_transformers/transformers/llm/operator/csrc/qbits_ut/ut_utils.py.


Thank you for your contribution! 💜

Note
This comment is automatically generated and updates for 360 minutes every 180 seconds. If you have any other questions, contact VincyZhang or XuehaoSun for help.

@a32543254
Copy link
Contributor

how about we add a doc to intro this new api to costumers?

@zhewang1-intc
Copy link
Contributor Author

how about we add a doc to intro this new api to costumers?

sure

@zhewang1-intc zhewang1-intc force-pushed the promote_qbits_as_itrex_module branch from 17b874b to 3826e7c Compare March 21, 2024 01:45
@VincyZhang
Copy link
Contributor

pylint is fixed in this PR: #1403

Copy link
Contributor

@a32543254 a32543254 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@VincyZhang VincyZhang merged commit 468d7cf into main Mar 21, 2024
19 checks passed
@VincyZhang VincyZhang deleted the promote_qbits_as_itrex_module branch March 21, 2024 09:25
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants