Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhanced Loop Unrolling #766

Merged
merged 5 commits into from
Mar 29, 2022
Merged

Enhanced Loop Unrolling #766

merged 5 commits into from
Mar 29, 2022

Conversation

m4rs-mt
Copy link
Owner

@m4rs-mt m4rs-mt commented Mar 3, 2022

This PR enhances the internal Loop Unrolling pass to be compatible with additional loops (involving mul, shr, div and shl operations). This allows ILGPU to unroll common Warp.Reduce patterns, which in turn reduces register consumption and improves runtime performance.

@m4rs-mt m4rs-mt added this to the v1.2 milestone Mar 3, 2022
@m4rs-mt m4rs-mt marked this pull request as ready for review March 3, 2022 19:04
@m4rs-mt m4rs-mt merged commit 107d3e0 into master Mar 29, 2022
@m4rs-mt m4rs-mt deleted the loop_unrolling branch March 29, 2022 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants