Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use __ldg on Scalar Read-only Data Stores #1795

Closed
wants to merge 2 commits into from

Conversation

fthaler
Copy link
Contributor

@fthaler fthaler commented Jul 24, 2024

Uses the __ldg instruction on loads from read-only data stores with supported scalar data types. This might improve performance due to more optimal caching and helps NVCC to elide redundant load instructions.

@fthaler
Copy link
Contributor Author

fthaler commented Jul 24, 2024

launch perftest

@gridtoolsjenkins
Copy link
Collaborator

Hi there, this is jenkins continuous integration...
Do you want me to verify this patch?

@fthaler
Copy link
Contributor Author

fthaler commented Jul 24, 2024

launch jenkins

@fthaler
Copy link
Contributor Author

fthaler commented Jul 24, 2024

launch perftest

@fthaler
Copy link
Contributor Author

fthaler commented Jul 24, 2024

launch jenkins

@@ -23,6 +23,10 @@
#include "../meta.hpp"
#include "data_store.hpp"

#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 350
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see both comments in the other __ldg pr

@@ -32,11 +36,43 @@ namespace gridtools {
return lhs;
}
};
template <class T>
struct const_ptr_wrapper {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should that better live somewhere in the sid module and possibly become the default also for simple_ptr_holder?

@fthaler
Copy link
Contributor Author

fthaler commented Aug 14, 2024

Closed in favor of #1802

@fthaler fthaler closed this Aug 14, 2024
fthaler added a commit that referenced this pull request Sep 24, 2024
Additionally fixes missing `host_view` etc. in const data stores. Gives
speedups when combined with #1795.
havogt pushed a commit that referenced this pull request Sep 30, 2024
Additionally fixes missing `host_view` etc. in const data stores. Gives
speedups when combined with #1795.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants