This library should be usable directly by a host compiler to make compatible host objects #940

ogiroux · 2020-09-21T19:56:32Z

In principle this should work, it's intended to work, but I think a lack of testing has allow this aspect to regress

brycelelbach · 2020-09-23T00:24:50Z

This adds a lot of testing overhead. What's the value-add for this?

brycelelbach · 2020-09-28T18:52:12Z

I'm not 100% clear what the ask is here. You want cuda::std:: working standalone as host? Is that right?

nvibd · 2021-03-03T16:37:07Z

Unfortunately, it doesn't work for all cases. We've found one such issue recently, see #968.

In our project, we use quite a few templates for more flexibility between host & device code. But it also means it's harder to separate all device-only code cleanly from the host compilation. In one case, we include <cuda/std/atomic> for some device code which propagates to host compilation. We could embed that include into an #ifdef __CUDA_ARCH__, but that would defeat one of the purposes of using libcu++ in the first place: Requiring as few such switches as possible.

As the front page of the project says: "It provides a heterogeneous implementation of the C++ Standard Library that can be used in and between CPU and GPU code". It really sounds like a great advantage to me and I would hope to see the compatibility be improved further! :)

gonzalobg · 2021-03-23T09:33:37Z

This fails (compile with bash bug.cpp):

#if 0 
  set -e
  g++ -std=c++14 $0 -o bug
  ./bug
  exit 0
#endif
#include <cuda/std/complex>
int main() {
   auto x = cuda::std::complex<double>{1., 1.};
   auto y = x + x;
   return 0;
}

and is a minimum reproducer of a bug that we hit during the Juelich hackathon over the past 2 weeks while porting a C++14 solid state physics app to GPUs.

This app has a C++14 template library dependency (blaze; similar to Eigen3), that causes NVCC and NVC++ to ICE, so we had to compile most of the app with g++, scoping GPU acceleration to separate TUs.

This app uses std::complex everywhere on all module APIs, but since its layout differs from that of cuda::std::complex and cuDoubleComplex, we can't interface raw memory between the parts of the app compiled with g++ or clang, and the parts compiled with nvcc/nvc++.

Workaround: add overloads to cuDoubleComplex to mock std::complex API without changing its ABI.

@brycelelbach

What's the value-add for this?

libcu++: The C++ Standard Library for Your Entire System

The value this feature adds is allowing libcu++ to interface with the system.

If libcu++ cannot be compiled by the most widely-used compilers in the system (g++ and clang++ on Linux), it cannot then be used on APIs/ABIs that must interface with the system, and therefore its usage must be scoped to the implementation-details of translation units that do not interface with the system.

I think that is a serious limitation.

The bigger libcu++ gets, the more work it will take to fix this.

brycelelbach · 2021-03-24T03:38:23Z

We'll try to prioritize this for the summer. 2.1.0 timeframe.

maddyscientist · 2021-03-24T03:47:11Z

Adding to the motivation for this: it turns out this is a big deal for QUDA as well and prevents QUDA from adopting cuda::std::complex. Both for the files that only use the host compiler (e.g., .cpp files), and also when we use g++ and nvrtc as opposed to g++ and nvcc.

jrhemstad · 2023-02-23T17:02:13Z

We want to do this, and we'll just need to figure out how to modify the lit infrastructure to compile tests with just a host compiler.

jrhemstad mentioned this issue Mar 3, 2021

<cuda/std/atomic> is incompatible with host compilation on Windows #968

Open

lpisha mentioned this issue Jan 23, 2023

Missing symbols in MSVC C++ compilation, many headers fail to compile #1006

Open

jrhemstad added the libcu++ For all items related to libcu++ label Feb 22, 2023

jarmak-nv assigned jrhemstad Feb 23, 2023

github-project-automation bot added this to CCCL Nov 8, 2023

github-project-automation bot moved this to Todo in CCCL Nov 8, 2023

jarmak-nv transferred this issue from NVIDIA/libcudacxx Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This library should be usable directly by a host compiler to make compatible host objects #940

This library should be usable directly by a host compiler to make compatible host objects #940

ogiroux commented Sep 21, 2020

brycelelbach commented Sep 23, 2020

brycelelbach commented Sep 28, 2020

nvibd commented Mar 3, 2021

gonzalobg commented Mar 23, 2021 •

edited

Loading

brycelelbach commented Mar 24, 2021

maddyscientist commented Mar 24, 2021

jrhemstad commented Feb 23, 2023

This library should be usable directly by a host compiler to make compatible host objects #940

This library should be usable directly by a host compiler to make compatible host objects #940

Comments

ogiroux commented Sep 21, 2020

brycelelbach commented Sep 23, 2020

brycelelbach commented Sep 28, 2020

nvibd commented Mar 3, 2021

gonzalobg commented Mar 23, 2021 • edited Loading

brycelelbach commented Mar 24, 2021

maddyscientist commented Mar 24, 2021

jrhemstad commented Feb 23, 2023

gonzalobg commented Mar 23, 2021 •

edited

Loading