Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Attempt to fix nested vectorization gemm performance on new build bot #7959

Merged
merged 5 commits into from
Nov 29, 2023

Conversation

abadams
Copy link
Member

@abadams abadams commented Nov 22, 2023

This new schedule is about the same speed as the old one on an M1, but generates much less code, spills less, and avoids the use of tbl instructions in favor of zips.

@abadams
Copy link
Member Author

abadams commented Nov 28, 2023

Looks like this did indeed fix the performance regression on the new buildbot for that specific test:

https://buildbot.halide-lang.org/master/#/builders/81/builds/178

However it seems other things are broken now

@steven-johnson
Copy link
Contributor

ugh, looks like my Mac maintenance yesterday broke Python on max-x86-3, that should be easy to fix, but the armbots just pull the source and then fail with no explanation? wtf?

@steven-johnson
Copy link
Contributor

looks like the armbot just needed a reboot, working on the other one now

@steven-johnson
Copy link
Contributor

Is this ready to land?

@abadams
Copy link
Member Author

abadams commented Nov 29, 2023

Yes, failures appear to be unrelated

@steven-johnson steven-johnson self-requested a review November 29, 2023 16:39
@steven-johnson steven-johnson merged commit b7468af into main Nov 29, 2023
3 checks passed
@steven-johnson steven-johnson deleted the abadams/fix_nested_vectorization_gemm_perf branch November 29, 2023 16:39
ardier pushed a commit to ardier/Halide-mutation that referenced this pull request Mar 3, 2024
…halide#7959)

* Better (simpler) schedules for nested vectorization gemm

* Remove early return

* Empty-Commit

---------

Co-authored-by: Steven Johnson <srj@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants