Add hardware check to fp8 quant #1314

jainapurva · 2024-11-19T21:57:34Z

Add hardware check to ensure fp8 quantization only attempts runs on compatible hardware.

Test Plan: Ran float8_dynamic_quant on A100, MI300X, H100

Issue: #1188

pytorch-bot · 2024-11-19T21:57:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1314

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 9624574 with merge base 8b1b168 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torchao/quantization/quant_api.py

drisspg · 2024-11-19T22:50:08Z

torchao/quantization/quant_api.py

@@ -939,6 +940,9 @@ def float8_dynamic_activation_float8_weight(
        mm_config (Float8MMConfig): Configuration for the matrix multiplication. Default uses fast accumulation.

    """
+    assert (
+        is_cuda_8_9
+    ), "Float8 dynamic activation quantization is only supported on CUDA 8.9 and above"


This should also be supported on AMD. We should probably update this check.

cc @jeffdaily

Summary: Test Plan: Tested on AMD Instinct MI300X Reviewers: Subscribers: Tasks: Tags:

drisspg · 2024-11-22T21:30:04Z

torchao/quantization/quant_api.py

@@ -939,6 +941,9 @@ def float8_dynamic_activation_float8_weight(
        mm_config (Float8MMConfig): Configuration for the matrix multiplication. Default uses fast accumulation.

    """
+    assert (
+        is_cuda_8_9 or is_MI300()


if granularity is PerTensor then it is sm89 if it is PerRow then it is currenlty sm90 or higher

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 19, 2024

jainapurva requested a review from drisspg November 19, 2024 21:57

jainapurva added the topic: bug fix Use this tag for PRs that fix bugs label Nov 19, 2024

Add hardware check to fp8 quant

88b6ba1

jainapurva force-pushed the fp8_check branch from 90b6c38 to 88b6ba1 Compare November 19, 2024 22:02

jerryzh168 reviewed Nov 19, 2024

View reviewed changes

torchao/quantization/quant_api.py Outdated Show resolved Hide resolved

drisspg reviewed Nov 19, 2024

View reviewed changes

MI300 check

2423c1d

Summary: Test Plan: Tested on AMD Instinct MI300X Reviewers: Subscribers: Tasks: Tags:

jainapurva force-pushed the fp8_check branch from 04e3529 to 2423c1d Compare November 22, 2024 01:00

jainapurva marked this pull request as ready for review November 22, 2024 20:05

Test fixes

b5945d3

drisspg approved these changes Nov 22, 2024

View reviewed changes

drisspg reviewed Nov 22, 2024

View reviewed changes

jainapurva added 2 commits November 25, 2024 11:05

Granularoty validation

436d3aa

Merge remote-tracking branch 'origin/main' into fp8_check

dfe2eb7

jainapurva linked an issue Nov 25, 2024 that may be closed by this pull request

[FLOAT8] Add Hardware Compatibility Check for FP8 Quantization #1188

Closed

Merge remote-tracking branch 'origin/main' into fp8_check

9624574

jainapurva merged commit 478d15b into main Nov 26, 2024
18 checks passed

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

remove obsolete param for using symlinks (pytorch#1314)

8c75754

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hardware check to fp8 quant #1314

Add hardware check to fp8 quant #1314

jainapurva commented Nov 19, 2024 •

edited

Loading

pytorch-bot bot commented Nov 19, 2024 •

edited

Loading

drisspg Nov 19, 2024

drisspg Nov 22, 2024

Add hardware check to fp8 quant #1314

Add hardware check to fp8 quant #1314

Conversation

jainapurva commented Nov 19, 2024 • edited Loading

pytorch-bot bot commented Nov 19, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1314

✅ No Failures

drisspg Nov 19, 2024

Choose a reason for hiding this comment

drisspg Nov 22, 2024

Choose a reason for hiding this comment

jainapurva commented Nov 19, 2024 •

edited

Loading

pytorch-bot bot commented Nov 19, 2024 •

edited

Loading