question: Is it possible to avoid ray in single machine multiple GPUs serving? #391

gaocegege · 2023-07-07T12:34:16Z

I'm uncertain whether it's feasible to bypass Ray when serving on a single machine with multiple GPUs. Ray introduces additional maintenance costs in this use case.

irasin · 2023-07-07T12:52:28Z

At least ray is needed if you want to use tensor parallel with multiple GPUS, since each Worker instance should exist in a single process but not thread. However, we can just replace ray with multiprocess in this regard.

I haven't seen the other reason why we need ray in the code, maybe there are something, for example, memory issue, object sharing or some other stuff.

hmellor · 2024-03-06T11:14:02Z

Closing because it appears Ray is only used if:

Specified by the user https://docs.vllm.ai/en/latest/models/engine_args.html#cmdoption-worker-use-ray
Tensor parallel is used https://docs.vllm.ai/en/latest/models/engine_args.html#cmdoption-worker-use-ray

WoosukKwon added the enhancement New feature or request label Jul 14, 2023

zhuohan123 mentioned this issue Jul 18, 2023

[Roadmap] vLLM Development Roadmap: H2 2023 #244

Closed

76 tasks

hmellor closed this as completed Mar 6, 2024

joerunde pushed a commit to joerunde/vllm that referenced this issue Jul 31, 2024

Await socket operations + some other minor cleanup (vllm-project#391)

7214fb8

ywang96 mentioned this issue Aug 3, 2024

[Model]Refactor MiniCPMV #7020

Merged

jikunshang pushed a commit to jikunshang/vllm that referenced this issue Oct 17, 2024

Add quickstart section to READMEs (vllm-project#391)

e598f3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question: Is it possible to avoid ray in single machine multiple GPUs serving? #391

question: Is it possible to avoid ray in single machine multiple GPUs serving? #391

gaocegege commented Jul 7, 2023

irasin commented Jul 7, 2023

hmellor commented Mar 6, 2024

question: Is it possible to avoid ray in single machine multiple GPUs serving? #391

question: Is it possible to avoid ray in single machine multiple GPUs serving? #391

Comments

gaocegege commented Jul 7, 2023

irasin commented Jul 7, 2023

hmellor commented Mar 6, 2024