How to allocate memory from 2nd GPU? #156

aeon3 · 2022-09-08T13:17:18Z

Here the error I have run into:

"RuntimeError: CUDA out of memory. Tried to allocate 18.00 GiB (GPU 0; 24.00 GiB total capacity; 20.51 GiB already allocated; 618.87 MiB free; 20.59 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF"

I have a 2nd GPU which could be used to allocate that extra 18GB, however I need help in figuring out how to show SD there is a 2nd GPU present.

Any thoughts?

AUTOMATIC1111 · 2022-09-08T20:49:07Z

Using memory from between two GPUs is not simple. I only have one so I can't research/develop this.

aeon3 · 2022-09-08T21:04:20Z

Using memory from between two GPUs is not simple. I only have one so I can't research/develop this.

Oh hi.
Well I have mine linked with Nvlink, I thought that would make it a breeze to benefit from memory pooling.
I guess it is not that different from having 2 unlinked GPUs afterall?

dev-greene · 2022-09-09T13:12:21Z

Would be interested in this as well. I don't think something like SLI is the answer though.
Even distributing the batch or iterations across available GPUs.

aeon3 · 2022-09-09T14:27:36Z

Found this guy talking about it here:
https://youtu.be/hBKcL8fNZ18?list=PLzSRtos7-PQRCskmdrgtMYIt_bKEbMPfD&t=481

Not sure if it's helpful or not but he shows some code

mchaker · 2022-09-23T01:22:00Z

This is the most intuitive and complete webui fork. It would be amazing if this could be implemented here:

NickLucche/stable-diffusion-nvidia-docker#8

Potential do double image output even with the same VRAM is awesome.

from #311

mchaker · 2022-09-23T01:23:37Z

For more than just 2 GPUs, NickLucche has code:

I imagine you're really busy with all the requests and bugs, but if you have 5 minutes, have a look at this file on Nickluche's project:

https://github.com/NickLucche/stable-diffusion-nvidia-docker/blob/master/parallel.py

He apparently generated an external wrapper to call the application, allowing it to query if there are or not multi-gpus, and in case there are, data parallel comes into play.

NickLucche · 2022-10-02T08:40:22Z

Hi! I could probably port this multi-gpu feature, but I would appreciate some pointers as to where in the code I should look for the actual model (I am using the vanilla one from huggingface).
Easiest mode would be implementing a ~data parallel approach, in which we have one model per GPU and you distribute the workload among them.
Given the amount of features this repo provides I think it could take some time to have em all supported in the parallel version.
Let me know your thoughts on this.

swcrazyfan · 2022-10-25T04:14:20Z

Hi! I could probably port this multi-gpu feature, but I would appreciate some pointers as to where in the code I should look for the actual model (I am using the vanilla one from huggingface).

Easiest mode would be implementing a ~data parallel approach, in which we have one model per GPU and you distribute the workload among them.

Given the amount of features this repo provides I think it could take some time to have em all supported in the parallel version.

Let me know your thoughts on this.

Is this something still in the works? I understand it could take a while to make everything support multiple GPU, but if I could use both of my GPU to generate images, that would be good enough. Like, if I select a batch of 2, each GPU would do one. If I did 8, each would do 4.

Is that complicated?

Extraltodeus · 2022-10-26T16:52:37Z

@swcrazyfan you can already load two instances at the same time.
#3377

Just use --device-id 0 in one and --device-id 1 in the other.
Also --port some_port_number with a different port for each instance.

Of course it is not an optimal solution and you might need more RAM to run both instances. --lowram might help too.

precompute · 2022-11-01T09:09:53Z

Is this being worked upon? It sounds like an awesome feature. Even if it's restricted to txt2img, it'd be a start.

I guess this would require major changes to the way images are handled right now, there'd probably would need to be a queue of sorts to make this work.

Lukium · 2022-11-07T10:53:25Z

Hi! I could probably port this multi-gpu feature, but I would appreciate some pointers as to where in the code I should look for the actual model (I am using the vanilla one from huggingface). Easiest mode would be implementing a ~data parallel approach, in which we have one model per GPU and you distribute the workload among them. Given the amount of features this repo provides I think it could take some time to have em all supported in the parallel version. Let me know your thoughts on this.

I'd be happy to help test this if it's something that's being worked on. I'm currently running an 11x RTX3090 server for a Discord Community using @Extraltodeus 's --device-id feature #3377, and I think that having some parallelism would further benefit the community greatly. I'm not sure if it's ok to mention community links here, but info is in my profile, and you're welcome to DM me on Discord if it's something you would like help testing.

Omegadarling · 2022-11-28T09:43:49Z

Just popping in to check on this. I also have an 8x 3090 machine and a 2x3090 machine (both have 256GB RAM) that would be great for testing parallelization.

zeigerpuppy · 2023-01-17T00:29:04Z

This would be a really great feature. Just being able to distribute a batch would be great,

Having a round-robin for "next GPU" would also be useful to distribute web requests across a pool of GPUs.

zeigerpuppy · 2023-01-17T00:32:08Z

p.s. I think this issue has changed a bit from a memory question to a multi-GPU support question in general. It may be good to alter the title to something like: "Multi GPU support for parallel queries". I think that is somewhat distinct from the first query regarding memory pooling (which is a much more difficult ask!)

hananbeer · 2023-03-08T04:04:25Z

Using memory from between two GPUs is not simple. I only have one so I can't research/develop this.

well let's get it funded then

moxSedai · 2023-04-14T04:22:13Z

I'm not sure this is really a parallel query question though, is it? I found it while looking for using multiple GPUs for a single query, and most of the discussion was based on that.

…ify AUTOMATIC1111#99 now returns the cheap_approx rather than grey image

C43H66N12O12S2 mentioned this issue Sep 22, 2022

[Feature request] Multi GPU data parelleling #311

Closed

dfaker added enhancement New feature or request dreams labels Sep 27, 2022

C43H66N12O12S2 mentioned this issue Oct 2, 2022

Parallelizing Stable Diffusion? #1394

Closed

duckness mentioned this issue Oct 9, 2022

How about Add multi-graphics card support? #2048

Closed

mezotaken removed the dreams label Jan 16, 2023

nne998 pushed a commit to fjteam/stable-diffusion-webui that referenced this issue Sep 26, 2023

fix: mps issue AUTOMATIC1111#156

800afed

Atry pushed a commit to Atry/stable-diffusion-webui that referenced this issue Jul 9, 2024

might fix AUTOMATIC1111#154, AUTOMATIC1111#155, AUTOMATIC1111#156 not…

860f8a4

…ify AUTOMATIC1111#99 now returns the cheap_approx rather than grey image

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to allocate memory from 2nd GPU? #156

How to allocate memory from 2nd GPU? #156

aeon3 commented Sep 8, 2022

AUTOMATIC1111 commented Sep 8, 2022

aeon3 commented Sep 8, 2022

dev-greene commented Sep 9, 2022 •

edited

Loading

aeon3 commented Sep 9, 2022

mchaker commented Sep 23, 2022

mchaker commented Sep 23, 2022

NickLucche commented Oct 2, 2022

swcrazyfan commented Oct 25, 2022

Extraltodeus commented Oct 26, 2022

precompute commented Nov 1, 2022

Lukium commented Nov 7, 2022

Omegadarling commented Nov 28, 2022

zeigerpuppy commented Jan 17, 2023

zeigerpuppy commented Jan 17, 2023

hananbeer commented Mar 8, 2023

moxSedai commented Apr 14, 2023

How to allocate memory from 2nd GPU? #156

How to allocate memory from 2nd GPU? #156

Comments

aeon3 commented Sep 8, 2022

AUTOMATIC1111 commented Sep 8, 2022

aeon3 commented Sep 8, 2022

dev-greene commented Sep 9, 2022 • edited Loading

aeon3 commented Sep 9, 2022

mchaker commented Sep 23, 2022

mchaker commented Sep 23, 2022

NickLucche commented Oct 2, 2022

swcrazyfan commented Oct 25, 2022

Extraltodeus commented Oct 26, 2022

precompute commented Nov 1, 2022

Lukium commented Nov 7, 2022

Omegadarling commented Nov 28, 2022

zeigerpuppy commented Jan 17, 2023

zeigerpuppy commented Jan 17, 2023

hananbeer commented Mar 8, 2023

moxSedai commented Apr 14, 2023

dev-greene commented Sep 9, 2022 •

edited

Loading