Set GPU ThreadPoolExecutor and set known libraries to use it #5084

mrocklin · 2021-07-17T02:36:43Z

With @madsbk 's recent work allowing for multiple executors, we might consider making a GPU ThreadPoolExecutor in workers by default if a GPU is detected, and then annotating tasks known to be GPU-targetted with that annotation. This would improve the likelihood that a user of vanilla dask has a good time with RAPIDS, cupy, or other known project.

We probably can't do this in full generality (it's hard to detect what code has GPUs) but we're no worse off if we don't catch something, and we can handle the common cases well.

Concretely, I propose:

Having the Worker class try importing pynvml (or some future NVIDIA python library) and if it detects a GPU then create a single-threaded ThreadPoolExecutor

try:
    import pynvml
except ImportError:
    pass
else:
    if pynvml.do_I_have_a_gpu():
        self.executors["gpu"] = ThreadPoolExecutor

In known GPU libraries we would add an annotation to every layer

class ArrayLayer:
    def __init__(self, ...):
        if "cupy" in str(type(self._meta)):
            self.annotations["executor"] = "gpu" # or use setdefault or something

cc @madsbk @quasiben @kkraus14

dask-cuda handles this for users who use it. This feels like something that we could upstream. This would also help with CPU-GPU mixed computation.

The text was updated successfully, but these errors were encountered:

mrocklin · 2021-07-23T14:53:55Z

cc @rjzamora @pentschev

mrocklin · 2021-07-26T20:30:27Z

So RAPIDS folks, would it be in-scope for RAPIDS to add annotations={"executor": "gpu"} to all layers?

See dask#5084

mrocklin · 2021-07-26T20:44:27Z

Adding this to the Worker here: #5123

quasiben · 2021-07-26T21:40:02Z

@pentschev what do you think about this ? We've seen some cases recently where users run into issues where a GPU is "detected" and either they don't want to use or, in a rare case, the GPU is not actually there. The default, however, can be nice for the far majority of users who do want to use GPUs and Dask

@mrocklin , we have run into many issues around getting the order or creating a cuda context correct and forking processing and creating cuda contexts at the right time when using UCX (though I think UCX folks are working on resolving this bug)

mrocklin · 2021-07-26T21:50:27Z

To be clear this doesn't make users use this executor. It just adds one for people to use if they want it. It doesn't change the default behavior of Dask. It just allows for people to use annotations like the following:

with dask.annotate(executor="gpu"):
    my dask code

And then they'll be assured that their code will run on single-threads

pentschev · 2021-07-26T22:01:26Z

We've seen some cases recently where users run into issues where a GPU is "detected" and either they don't want to use or, in a rare case, the GPU is not actually there. The default, however, can be nice for the far majority of users who do want to use GPUs and Dask

Ben has a good point. In fact #5121 is fixing yet another of those. In that case the user is running Distributed with nvidia-docker but without a GPU, and causing another uncatched exception to be raised.

I think the overall idea is useful, and as long as this doesn't force a new default on users who happen to have GPU(s) installed, then I see no reason not to do that.

mrocklin · 2021-07-26T22:06:08Z

I'm curious, would you all consider adding an executor="gpu" annotation to RAPIDS layers? If not, what situations would cause you to be concerned?

mrocklin · 2021-07-26T22:06:43Z

I would like to try to upstream some of the more general parts of dask-cuda.

jakirkham · 2021-07-26T22:12:02Z

I would like to try to upstream some of the more general parts of dask-cuda.

This sounds interesting. What other things would you be open to upstreaming? Some thoughts on things that might be of interest in upstream:

Explicit comms
JIT spilling
Transmission of spilled objects
CUDA-specific spilling
?

Historically one of the issues here has been the lack of GPU support on CI. If we are able to tackle that ( dask/community#138 ), maybe this becomes more reasonable?

mrocklin · 2021-07-26T22:17:22Z

I think that there are good ideas behind all of those things, and like the PR here does for threads and executors, we would need to find nice generic ways to incorporate them.

rjzamora · 2021-07-29T16:16:10Z

would you all consider adding an executor="gpu" annotation to RAPIDS layers?

Peter and Ben probably have the best idea of what the appropriate defaults should be, but this particular request seems reasonable to me.

mrocklin added a commit to mrocklin/distributed that referenced this issue Jul 26, 2021

Add GPU executor if GPU is present

b4a2363

See dask#5084

mrocklin mentioned this issue Jul 26, 2021

Add GPU executor if GPU is present #5123

Merged

This was referenced Aug 11, 2021

Heterogeneous Computing Design #5201

Open

Dynamic annotations #5207

Draft

fjetter mentioned this issue Dec 12, 2023

Disabling GPU diagnostics prevents GPU executor from getting created #8338

Closed

jakirkham mentioned this issue Dec 12, 2023

Remove GPU executor #8399

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set GPU ThreadPoolExecutor and set known libraries to use it #5084

Set GPU ThreadPoolExecutor and set known libraries to use it #5084

mrocklin commented Jul 17, 2021

mrocklin commented Jul 23, 2021

mrocklin commented Jul 26, 2021

mrocklin commented Jul 26, 2021

quasiben commented Jul 26, 2021

mrocklin commented Jul 26, 2021

pentschev commented Jul 26, 2021

mrocklin commented Jul 26, 2021

mrocklin commented Jul 26, 2021

jakirkham commented Jul 26, 2021

mrocklin commented Jul 26, 2021

rjzamora commented Jul 29, 2021

Set GPU ThreadPoolExecutor and set known libraries to use it #5084

Set GPU ThreadPoolExecutor and set known libraries to use it #5084

Comments

mrocklin commented Jul 17, 2021

mrocklin commented Jul 23, 2021

mrocklin commented Jul 26, 2021

mrocklin commented Jul 26, 2021

quasiben commented Jul 26, 2021

mrocklin commented Jul 26, 2021

pentschev commented Jul 26, 2021

mrocklin commented Jul 26, 2021

mrocklin commented Jul 26, 2021

jakirkham commented Jul 26, 2021

mrocklin commented Jul 26, 2021

rjzamora commented Jul 29, 2021