-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TMem check the stride of outer dims #4070
base: main
Are you sure you want to change the base?
Conversation
Review updated until commit 3b44b2f Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
!test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I decided to not put the new tests into the tutorial. They are too complicated to be a good material for education. I want the reader of the tutorial to focus on the most important and basic concepts.
Validating that the TMem ld/st is scheduled as a warp-collective is fundamentally the same problem as validating the vectorization of a slice op with vectorization factor 32. This validation requires us to check the inner dim size, as well as outer dim's strides. Currently, we are only checking the inner-dim's size. This PR adds the missing stride check.
See discussion: #4015 (comment)