[TIR] Add CUDA int4 tensor core intrinsics #14598

vinx13 · 2023-04-11T23:35:52Z

This PR added int4 tensor intrinsic for CUDA tensor core.

tvm-bot · 2023-04-11T23:35:55Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Hzfengsy, @junrushao, @quic-sanirudh, @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

Hzfengsy

LGTM. But I want to remind you that the int4 Tensor Core support is removed from the 4th Tensor Core (Rtx 40 serious and Hopper)

yzh119 · 2023-04-12T01:45:20Z

@Hzfengsy , int4 Tensor Cores is still supported in RTX 40 series, per Ada whitepaper.

yzh119

A slight issue, otherwise LGTM.

yzh119 · 2023-04-12T01:46:53Z

python/tvm/tir/tensor_intrin/cuda.py

@@ -817,6 +916,12 @@ def wmma_sync_impl(a: T.handle, b: T.handle, c: T.handle) -> None:
    *get_wmma_sync_intrin(16, 16, 16, "int8", "int32", True),
 )

+WMMA_SYNC_8x8x32_s4s4s32_TRANS_INTRIN = "wmma_sync_8x8x32_s4s4s32_trans"


"wmma_sync_8x8x32_s4s4s32" is missing.

sub-byte tensor core only allows A in row major and B in col major

Oh that's interesting! Maybe we can leave a note somewhere.

[TIR] Add CUDA int4 tensor core intrinsics

ba7b3c5

github-actions bot requested review from junrushao, masahi and tqchen April 11, 2023 23:36

Hzfengsy approved these changes Apr 12, 2023

View reviewed changes

yzh119 reviewed Apr 12, 2023

View reviewed changes

lint

bd0aa90

yzh119 approved these changes Apr 12, 2023

View reviewed changes

tqchen merged commit c1d1e9f into apache:main Apr 12, 2023

ysh329 mentioned this pull request Jul 12, 2023

[Release] v0.13.0 Release Candidate Notes #15295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIR] Add CUDA int4 tensor core intrinsics #14598

[TIR] Add CUDA int4 tensor core intrinsics #14598

vinx13 commented Apr 11, 2023

tvm-bot commented Apr 11, 2023

Hzfengsy left a comment

yzh119 commented Apr 12, 2023

yzh119 left a comment

yzh119 Apr 12, 2023

vinx13 Apr 12, 2023

yzh119 Apr 12, 2023 •

edited

Loading

[TIR] Add CUDA int4 tensor core intrinsics #14598

[TIR] Add CUDA int4 tensor core intrinsics #14598

Conversation

vinx13 commented Apr 11, 2023

tvm-bot commented Apr 11, 2023

Hzfengsy left a comment

Choose a reason for hiding this comment

yzh119 commented Apr 12, 2023

yzh119 left a comment

Choose a reason for hiding this comment

yzh119 Apr 12, 2023

Choose a reason for hiding this comment

vinx13 Apr 12, 2023

Choose a reason for hiding this comment

yzh119 Apr 12, 2023 • edited Loading

Choose a reason for hiding this comment

yzh119 Apr 12, 2023 •

edited

Loading