bf16xint16_gemm operator: add --transpose option #2466

davidberard98 · 2024-09-23T23:22:03Z

--transpose will make this benchmark test a int16 x bf16 mm instead of a bf16 x int16.

This matters for H100, because the wgmma instruction can take registers only on the LHS. So int16 x bf16 is probably the easier one to support efficiently.

facebook-github-bot · 2024-09-23T23:22:27Z

@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: `--transpose` will make this benchmark test a int16 x bf16 mm instead of a bf16 x int16. This matters for H100, because the wgmma instruction can take registers only on the LHS. So int16 x bf16 is probably the easier one to support efficiently. Pull Request resolved: pytorch#2466 Test Plan: In OSS: ran `python run_benchmark.py triton --op bf16xint16_gemm --transpose` Internally, ran `buck2 run mode/opt //pytorch/benchmark:triton -- --op bf16xint16_gemm --transpose` Internally, we run into the issue fixed by triton-lang/triton#4695; but otherwise, they both run. Differential Revision: D63294109 Pulled By: davidberard98

facebook-github-bot · 2024-09-24T00:17:14Z

This pull request was exported from Phabricator. Differential Revision: D63294109

facebook-github-bot · 2024-09-24T16:14:14Z

@davidberard98 merged this pull request in 0ab0e47.

facebook-github-bot added the cla signed label Sep 23, 2024

davidberard98 had a problem deploying to docker-s3-upload September 23, 2024 23:22 — with GitHub Actions Error

davidberard98 had a problem deploying to docker-s3-upload September 23, 2024 23:23 — with GitHub Actions Error

facebook-github-bot added the fb-exported label Sep 24, 2024

davidberard98 force-pushed the bf16xint16-transpose branch from dad7188 to edeb11a Compare September 24, 2024 00:17

davidberard98 had a problem deploying to docker-s3-upload September 24, 2024 00:18 — with GitHub Actions Failure

davidberard98 had a problem deploying to docker-s3-upload September 24, 2024 00:19 — with GitHub Actions Failure

facebook-github-bot closed this in 0ab0e47 Sep 24, 2024

facebook-github-bot added the Merged label Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bf16xint16_gemm operator: add --transpose option #2466

bf16xint16_gemm operator: add --transpose option #2466

davidberard98 commented Sep 23, 2024

facebook-github-bot commented Sep 23, 2024

facebook-github-bot commented Sep 24, 2024

facebook-github-bot commented Sep 24, 2024

bf16xint16_gemm operator: add --transpose option #2466

bf16xint16_gemm operator: add --transpose option #2466

Conversation

davidberard98 commented Sep 23, 2024

facebook-github-bot commented Sep 23, 2024

facebook-github-bot commented Sep 24, 2024

facebook-github-bot commented Sep 24, 2024