Workarounds for CUDA 12.1 and 12.2 #1764

havogt · 2023-08-03T07:23:09Z

CUDA 12.1 and 12.2 have a problem with constexpr, e.g. in the context of CTAD, see #1766. The workaround is to do pre-C++17 make_tuple-construction or construct from a (possibly moved-from) lvalue.

CI: add GCC + CUDA 12.1/12.2 and NVHPC 23.7

…_237_compilation

havogt · 2023-08-10T05:04:40Z

launch jenkins

tests/include/nvcc_workarounds.hpp

tests/regression/fn/fn_cartesian_vertical_advection.cpp

havogt · 2023-08-10T05:28:23Z

launch jenkins

havogt · 2023-08-10T07:06:49Z

launch perftests

petiaccja

Some minor cleanups, otherwise not much to do in my opinion.

One question though, why not just use gridtools::make_tuple everywhere, why a make_1_tuple?

include/gridtools/fn/column_stage.hpp

tests/include/nvcc_workarounds.hpp

tests/unit_tests/common/test_tuple.cpp

havogt · 2023-08-15T06:44:23Z

Some minor cleanups, otherwise not much to do in my opinion.

One question though, why not just use gridtools::make_tuple everywhere, why a make_1_tuple?

I wanted to emphasize that this is a workaround that might be needed for users. If you see weird behavior in your code, you might compare against the example and find the problem.

havogt · 2023-08-15T06:52:54Z

launch jenkins

tests/unit_tests/common/test_tuple.cpp

petiaccja · 2023-08-15T08:10:56Z

Some minor cleanups, otherwise not much to do in my opinion.
One question though, why not just use gridtools::make_tuple everywhere, why a make_1_tuple?

I wanted to emphasize that this is a workaround that might be needed for users. If you see weird behavior in your code, you might compare against the example and find the problem.

I understand, however, is this workaround ever gonna go away? (Ever = within 1-2 years.) The problem is, unless they fix their older compiler releases or we simply stop supporting those specific releases, we have to keep this workaround in the code, and then I'd rather use a proper interface that works consistently even if it's not obvious why that's used.

Co-authored-by: Péter Kardos <kardospeter1994@hotmail.com>

havogt · 2023-08-16T09:29:52Z

launch jenkins

petiaccja

As discussed, we'll keep the workaround function make_1_tuple because it's only used in the tests, so it's not visible to users. Users should be discouraged to use affected CUDA versions unless nVidia fixes the problem.

CUDA 12.1 and 12.2 have a problem with constexpr, e.g. in the context of CTAD, see #1766. The workaround is to do pre-C++17 `make_tuple`-construction or construct from a (possibly moved-from) lvalue. CI: add GCC + CUDA 12.1/12.2 and NVHPC 23.7 Co-authored-by: Péter Kardos <kardospeter1994@hotmail.com>

havogt and others added 4 commits August 3, 2023 09:22

CI: Add NVHPC 23.7 to compilation test

2fe5571

CI: test CUDA 12.1 and 12.2

ee95989

Merge remote-tracking branch 'upstream/test_cuda12.2' into test_nvhpc…

ac9e05a

…_237_compilation

cuda 12.1/12.2 partial workarounds

52ec547

havogt requested a review from petiaccja August 10, 2023 05:05

havogt mentioned this pull request Aug 10, 2023

CI: test CUDA 12.1 and 12.2 #1765

Closed

havogt commented Aug 10, 2023

View reviewed changes

tests/include/nvcc_workarounds.hpp Outdated Show resolved Hide resolved

Update tests/include/nvcc_workarounds.hpp

7c97e9a

havogt commented Aug 10, 2023

View reviewed changes

tests/regression/fn/fn_cartesian_vertical_advection.cpp Show resolved Hide resolved

havogt changed the title ~~CI: Add NVHPC 23.7 to compilation test~~ Workarounds for CUDA 12.1 and 12.2 Aug 10, 2023

petiaccja suggested changes Aug 14, 2023

View reviewed changes

include/gridtools/fn/column_stage.hpp Outdated Show resolved Hide resolved

include/gridtools/fn/column_stage.hpp Outdated Show resolved Hide resolved

tests/include/nvcc_workarounds.hpp Show resolved Hide resolved

tests/unit_tests/common/test_tuple.cpp Outdated Show resolved Hide resolved

introduce workaround macro

7eb98b3

havogt requested a review from petiaccja August 15, 2023 07:17

petiaccja suggested changes Aug 15, 2023

View reviewed changes

tests/unit_tests/common/test_tuple.cpp Outdated Show resolved Hide resolved

Update tests/unit_tests/common/test_tuple.cpp

6dbab72

Co-authored-by: Péter Kardos <kardospeter1994@hotmail.com>

petiaccja approved these changes Aug 16, 2023

View reviewed changes

havogt merged commit 5e1011a into master Aug 16, 2023
56 checks passed

havogt deleted the test_nvhpc_237_compilation branch August 16, 2023 12:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workarounds for CUDA 12.1 and 12.2 #1764

Workarounds for CUDA 12.1 and 12.2 #1764

havogt commented Aug 3, 2023 •

edited

Loading

havogt commented Aug 10, 2023

havogt commented Aug 10, 2023

havogt commented Aug 10, 2023

petiaccja left a comment

havogt commented Aug 15, 2023

havogt commented Aug 15, 2023

petiaccja commented Aug 15, 2023

havogt commented Aug 16, 2023

petiaccja left a comment

Workarounds for CUDA 12.1 and 12.2 #1764

Workarounds for CUDA 12.1 and 12.2 #1764

Conversation

havogt commented Aug 3, 2023 • edited Loading

havogt commented Aug 10, 2023

havogt commented Aug 10, 2023

havogt commented Aug 10, 2023

petiaccja left a comment

Choose a reason for hiding this comment

havogt commented Aug 15, 2023

havogt commented Aug 15, 2023

petiaccja commented Aug 15, 2023

havogt commented Aug 16, 2023

petiaccja left a comment

Choose a reason for hiding this comment

havogt commented Aug 3, 2023 •

edited

Loading