8-bit packing support #1248

tibidoh · 2024-11-08T17:55:58Z

Summary:
Add support for 8-bit quantization for Llama in torchchat.

8-bit packing behaves exactly like 1-7 bit packing, but the implementation is slightly different: 8-bit chunks are used as-is without shifting.

Reviewed By: metascroy

Differential Revision: D65570988

Summary: Add support for 8-bit quantization for Llama in torchchat. 8-bit packing behaves exactly like 1-7 bit packing, but the implementation is slightly different: 8-bit chunks are used as-is without shifting. Reviewed By: metascroy Differential Revision: D65570988

pytorch-bot · 2024-11-08T17:56:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1248

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit 101e980 with merge base e41ca4e ():

NEW FAILURE - The following job has failed:

Run Regression Tests / test (CUDA 2.5, linux.g5.12xlarge.nvidia.gpu, torch==2.5.0 --index-url https://download.pytorch.o... / linux-job (gh)
RuntimeError: Command docker exec -t 33758f98966ae4e813c146a81c3d3955bb6cc3ffec0e98a2b9264a96f40ea328 /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-11-08T17:56:08Z

This pull request was exported from Phabricator. Differential Revision: D65570988

Differential Revision: D65570988 Pull Request resolved: #1248

Differential Revision: D65570988 Pull Request resolved: pytorch#1248

…ch#1248) * Fix non-MM multiturn: Use legacy formatting * Absorb non-MM OpenAI dialog parsing into generic input parsing * Lint and docstrings

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 8, 2024

facebook-github-bot added the fb-exported label Nov 8, 2024

metascroy approved these changes Nov 8, 2024

View reviewed changes

facebook-github-bot merged commit 653e4a9 into pytorch:main Nov 8, 2024
16 of 19 checks passed

jainapurva pushed a commit that referenced this pull request Nov 11, 2024

8-bit packing support

c58b53d

Differential Revision: D65570988 Pull Request resolved: #1248

jainapurva pushed a commit that referenced this pull request Nov 12, 2024

8-bit packing support

d8dec0f

Differential Revision: D65570988 Pull Request resolved: #1248

sunjiweiswift pushed a commit to sunjiweiswift/ao that referenced this pull request Nov 25, 2024

8-bit packing support

5c6b8e1

Differential Revision: D65570988 Pull Request resolved: pytorch#1248

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8-bit packing support #1248

8-bit packing support #1248

tibidoh commented Nov 8, 2024

pytorch-bot bot commented Nov 8, 2024 •

edited

Loading

facebook-github-bot commented Nov 8, 2024

8-bit packing support #1248

8-bit packing support #1248

Conversation

tibidoh commented Nov 8, 2024

pytorch-bot bot commented Nov 8, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1248

❌ 1 New Failure, 2 Unrelated Failures

facebook-github-bot commented Nov 8, 2024

pytorch-bot bot commented Nov 8, 2024 •

edited

Loading