Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for groupwise quantization for int8 weight only quantization #1121

Merged
merged 1 commit into from
Oct 19, 2024

Conversation

jerryzh168
Copy link
Contributor

Summary:
This is to support deprecating torchchat int8 weight only quantization: https://github.com/pytorch/torchchat/blob/ecc628da7c32c486742d92a751ed045b2a2194be/torchchat/utils/quantize.py#L582

Test Plan:
python test/integration/test_integration.py -k test_weight_only_groupwise_quant

Reviewers:

Subscribers:

Tasks:

Tags:

Summary:
This is to support deprecating torchchat int8 weight only quantization: https://github.com/pytorch/torchchat/blob/ecc628da7c32c486742d92a751ed045b2a2194be/torchchat/utils/quantize.py#L582

Test Plan:
python test/integration/test_integration.py -k test_weight_only_groupwise_quant

Reviewers:

Subscribers:

Tasks:

Tags:
Copy link

pytorch-bot bot commented Oct 18, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1121

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit cf7eafa with merge base 3475aed (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 18, 2024
@jerryzh168 jerryzh168 merged commit bc2aaaf into pytorch:main Oct 19, 2024
17 checks passed
@jerryzh168 jerryzh168 deleted the int8wo-groupwise branch October 19, 2024 01:09
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
… loading per stage and future perf measurements (pytorch#1121)

* add TrackTime, monitor perf for weight loading per stage

* add CUDATrackTime

* ruff formatting

* add device for CUDATrackTime per PR feedback

* add comment re: cuda context, ruff format
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants