-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add WOQ int8 test with Inductor Freeze #362
Add WOQ int8 test with Inductor Freeze #362
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/362
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 34f38ca with merge base bc2f8b7 (): This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Hi @jerryzh168, I found current TorchAO testing are all without turning on of |
sure, what is the effect of that? is this required to get speedup for cpu? |
is this issue related? pytorch/pytorch#122813 |
I believe enabling freezing will make it easier for us to implement the first optimization mentioned by @jgong5 in pytorch/pytorch#122813 (comment). However, further work is needed to optimize the int4 WOQ kernel.
|
The motivation to add this UT is:
Looks like the new added UT |
are you referring to this error: `RuntimeError: Expected a proper Tensor but got None (or an undefined Tensor in C++) for argument #1 'mat2' While executing %mm : [num_users=1] = call_function[target=torch.ops.aten.mm.default](args = (%view, %convert_element_type), kwargs = {})` this seems like an implementation problem in AffineQuantizedTensor, but you should be able to repro I feel, since it failed in all torch versions (2.2.2, 2.3 and nightly) |
The failure with torch nightly is
which is different as 2.2 or 2.3
|
fac758a
to
34f38ca
Compare
Hi @jerryzh168, rebase to fix the nightly failure and skip the test with pytorch before 2.4. Could you help to approve for CI running again? BTW: Why |
this requires a fix: pytorch/pytorch#124888 that is only available in 2.4+ |
Looks all the UT are green now. |
Co-authored-by: Martin Yuan <myuan@meta.com>
Summary
Add the WOQ int8 test with inductor freeze
Test Plan