Do you support streaming generating outputs? #245

ltz0120 · 2023-06-24T15:48:01Z

ltz0120
Jun 24, 2023

Jun 25, 2023

Yes, our FastAPI and OpenAI servers support streaming outputs. Just set up the server with

python -m vllm.entrypoints.api_server

or

python -m vllm.entrypoints.openai.api_server

and then add "stream": True in client request (by default it's false).
See

vllm/examples/api_client.py

Line 26 in 665c489

"stream": stream,

View full answer

WoosukKwon · 2023-06-25T02:47:11Z

WoosukKwon
Jun 25, 2023
Maintainer

Yes, our FastAPI and OpenAI servers support streaming outputs. Just set up the server with

python -m vllm.entrypoints.api_server

or

python -m vllm.entrypoints.openai.api_server

and then add "stream": True in client request (by default it's false).
See

vllm/examples/api_client.py

Line 26 in 665c489

"stream": stream,

0 replies

zhuohan123 · 2023-06-25T17:42:48Z

zhuohan123
Jun 25, 2023
Maintainer

In addition, you can see the streaming result from the API server via

python vllm/examples/api_client.py --stream

0 replies

AI-General · 2024-01-25T19:30:21Z

AI-General
Jan 25, 2024

@WoosukKwon
Hello,
I tested

python vllm/examples/api_client.py --stream

It works.

But does it support same streaming api as openai?

curl http://localhost:8000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "facebook/opt-125m",
        "messages": [
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Who won the world series in 2020?"}
        ],
       "stream": true
    }'

0 replies

enze5088 · 2024-03-05T18:34:13Z

enze5088
Mar 5, 2024

Does streaming output only support return async_generator?

0 replies

Carolmelon · 2024-05-02T10:56:48Z

Carolmelon
May 2, 2024

你们的文档挺难看懂的感觉，看了半天还是得去看源码😂 可以考虑多加几个demo吗，感谢

0 replies

tempcollab · 2024-08-28T19:30:54Z

tempcollab
Aug 28, 2024

How can I get the usage metrics (number of input tokens and output tokens) in streaming mode? @WoosukKwon

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you support streaming generating outputs? #245

{{title}}

Replies: 6 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Do you support streaming generating outputs? #245

ltz0120 Jun 24, 2023

Replies: 6 comments

WoosukKwon Jun 25, 2023 Maintainer

zhuohan123 Jun 25, 2023 Maintainer

AI-General Jan 25, 2024

enze5088 Mar 5, 2024

Carolmelon May 2, 2024

tempcollab Aug 28, 2024

ltz0120
Jun 24, 2023

WoosukKwon
Jun 25, 2023
Maintainer

zhuohan123
Jun 25, 2023
Maintainer

AI-General
Jan 25, 2024

enze5088
Mar 5, 2024

Carolmelon
May 2, 2024

tempcollab
Aug 28, 2024