Support fp8e5m2 dtype #7740

lsy323 · 2024-07-25T00:06:46Z

Add support for torch.fp8e5m2 dtype.

Right now torch_xla/csrc/tensor_util.cpp contains many duplicated codes, in the next PR I'll try to refactor it to make it cleaner.

The other fp8 variants and will be added in the following PRs

Test:
Added fp8 unit test

test/test_fp8.py

torch_xla/csrc/tensor_util.cpp

miladm · 2024-07-25T02:05:53Z

Please include a documentation for FP8 - with example for users. Feel free to enrich the document as you add more FP8 capabilities/variants.
How can we add a small micro-benchmark to the documentation to illustrate the benefits of FP8 vs. the baseline BF16?
To my knowledge, PT AMP does not support FP8; correct?

miladm · 2024-08-01T16:10:25Z

@lsy323 we should include an accuracy correctness analysis in our benchmarking and documentation.

JackCaoG

approve to unblock, mostly lgtm we can have a follow up pr to fix review comments

lsy323 · 2024-08-06T17:03:26Z

Hi @JackCaoG, @miladm, thank you for reviewing! Sorry for the delayed respond for the comments.

we should include an accuracy correctness analysis in our benchmarking and documentation.

In this PR the unit test is included to verify the correctness of the fp8 dtype and matmul op. Let's have documentation in a separate PR (After e4m3 variants are ready)

add fp8e5m2

3c5481b

lsy323 requested a review from miladm July 25, 2024 00:06

lsy323 assigned JackCaoG and lsy323 and unassigned JackCaoG Jul 25, 2024

lsy323 requested a review from JackCaoG July 25, 2024 00:08

lsy323 added the fp8 label Jul 25, 2024

Siyuan Liu added 2 commits July 25, 2024 00:11

remove unused imports

7c57dee

add test to ci

e122d38

JackCaoG reviewed Jul 25, 2024

View reviewed changes

test/test_fp8.py Show resolved Hide resolved

JackCaoG reviewed Jul 25, 2024

View reviewed changes

torch_xla/csrc/tensor_util.cpp Show resolved Hide resolved

miladm reviewed Jul 25, 2024

View reviewed changes

torch_xla/csrc/tensor_util.cpp Show resolved Hide resolved

add to tpu ci

36c8e9b

qihqi requested review from miladm and JackCaoG August 5, 2024 17:12

JackCaoG approved these changes Aug 5, 2024

View reviewed changes

lsy323 merged commit 1ed2626 into master Aug 6, 2024
21 of 22 checks passed

lsy323 deleted the lsiyuan/add-fp8e5m2 branch August 6, 2024 17:35

lsy323 mentioned this pull request Aug 13, 2024

Add fp8e4m3fn support #7842

Merged

aws-satyajith mentioned this pull request Aug 29, 2024

Add f8e5m2 support #7924

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support fp8e5m2 dtype #7740

Support fp8e5m2 dtype #7740

lsy323 commented Jul 25, 2024

miladm commented Jul 25, 2024 •

edited

Loading

miladm commented Aug 1, 2024

JackCaoG left a comment

lsy323 commented Aug 6, 2024

Support fp8e5m2 dtype #7740

Support fp8e5m2 dtype #7740

Conversation

lsy323 commented Jul 25, 2024

miladm commented Jul 25, 2024 • edited Loading

miladm commented Aug 1, 2024

JackCaoG left a comment

Choose a reason for hiding this comment

lsy323 commented Aug 6, 2024

miladm commented Jul 25, 2024 •

edited

Loading