Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Use shared memory for data tiled multi_mma ops #18625

Merged
merged 1 commit into from
Oct 1, 2024

Conversation

Max191
Copy link
Contributor

@Max191 Max191 commented Sep 27, 2024

This enables shared memory usage for data tiled multi_mma ops. The only change is that promote_operands is now set on data tiled multi_mma configs.

Copy link
Contributor

@qedawkins qedawkins left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, I had forgot I hadn't landed this yet: #18576

(That change was made specifically for this reason, so you can rebase on top of it now).

@Max191
Copy link
Contributor Author

Max191 commented Sep 27, 2024

Sorry, I had forgot I hadn't landed this yet: #18576

(That change was made specifically for this reason, so you can rebase on top of it now).

Nice, I'll wait on this until my other kernel config PR lands then. The e2e tests work without shared memory too, so it's just an optimzation right now.

@Max191 Max191 marked this pull request as draft September 27, 2024 19:40
@Max191 Max191 force-pushed the data-tiled-mma-use-shared-mem branch from 046d3c4 to 63ba6c2 Compare September 30, 2024 15:03
@Max191
Copy link
Contributor Author

Max191 commented Sep 30, 2024

This is based on #18623 now. Changes for this PR are in the last 2 commits.

EDIT: rebased now.

@Max191 Max191 marked this pull request as ready for review September 30, 2024 15:05
Signed-off-by: Max Dawkins <max.dawkins@gmail.com>
@Max191 Max191 force-pushed the data-tiled-mma-use-shared-mem branch from 63ba6c2 to 25d2c05 Compare October 1, 2024 17:12
@Max191 Max191 enabled auto-merge (squash) October 1, 2024 17:19
@Max191 Max191 merged commit 839f7f6 into iree-org:main Oct 1, 2024
35 checks passed
Max191 added a commit that referenced this pull request Oct 7, 2024
This adds some matmul e2e tests for GPU data tiling on `gfx94*` chips.

This depends on:
#18626
#18623
#18624
#18625

Signed-off-by: Max Dawkins <max.dawkins@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants