Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

8-bit packing support #1248

Merged
merged 1 commit into from
Nov 8, 2024
Merged

Conversation

tibidoh
Copy link
Contributor

@tibidoh tibidoh commented Nov 8, 2024

Summary:
Add support for 8-bit quantization for Llama in torchchat.

8-bit packing behaves exactly like 1-7 bit packing, but the implementation is slightly different: 8-bit chunks are used as-is without shifting.

Reviewed By: metascroy

Differential Revision: D65570988

Summary:
Add support for 8-bit quantization for Llama in torchchat. 

8-bit packing behaves exactly like 1-7 bit packing, but the implementation is slightly different: 8-bit chunks are used as-is without shifting.

Reviewed By: metascroy

Differential Revision: D65570988
Copy link

pytorch-bot bot commented Nov 8, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1248

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit 101e980 with merge base e41ca4e (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 8, 2024
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D65570988

@facebook-github-bot facebook-github-bot merged commit 653e4a9 into pytorch:main Nov 8, 2024
16 of 19 checks passed
jainapurva pushed a commit that referenced this pull request Nov 11, 2024
Differential Revision: D65570988

Pull Request resolved: #1248
jainapurva pushed a commit that referenced this pull request Nov 12, 2024
Differential Revision: D65570988

Pull Request resolved: #1248
sunjiweiswift pushed a commit to sunjiweiswift/ao that referenced this pull request Nov 25, 2024
Differential Revision: D65570988

Pull Request resolved: pytorch#1248
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
…ch#1248)

* Fix non-MM multiturn: Use legacy formatting

* Absorb non-MM OpenAI dialog parsing into generic input parsing

* Lint and docstrings
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants