Do you support streaming generating outputs? #245

ltz0120 · 2023-06-24T15:48:01Z

ltz0120
Jun 24, 2023

Jun 25, 2023

Yes, our FastAPI and OpenAI servers support streaming outputs. Just set up the server with

python -m vllm.entrypoints.api_server

or

python -m vllm.entrypoints.openai.api_server

and then add "stream": True in client request (by default it's false).
See

vllm/examples/api_client.py

Line 26 in 665c489

"stream": stream,

View full answer

WoosukKwon · 2023-06-25T02:47:11Z

WoosukKwon
Jun 25, 2023
Maintainer

Yes, our FastAPI and OpenAI servers support streaming outputs. Just set up the server with

python -m vllm.entrypoints.api_server

or

python -m vllm.entrypoints.openai.api_server

and then add "stream": True in client request (by default it's false).
See

vllm/examples/api_client.py

Line 26 in 665c489

"stream": stream,

0 replies

zhuohan123 · 2023-06-25T17:42:48Z

zhuohan123
Jun 25, 2023
Maintainer

In addition, you can see the streaming result from the API server via

python vllm/examples/api_client.py --stream

0 replies

AI-General · 2024-01-25T19:30:21Z

AI-General
Jan 25, 2024

@WoosukKwon
Hello,
I tested

python vllm/examples/api_client.py --stream

It works.

But does it support same streaming api as openai?

curl http://localhost:8000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "facebook/opt-125m",
        "messages": [
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Who won the world series in 2020?"}
        ],
       "stream": true
    }'

0 replies

enze5088 · 2024-03-05T18:34:13Z

enze5088
Mar 5, 2024

Does streaming output only support return async_generator?

0 replies

Carolmelon · 2024-05-02T10:56:48Z

Carolmelon
May 2, 2024

你们的文档挺难看懂的感觉，看了半天还是得去看源码😂 可以考虑多加几个demo吗，感谢

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you support streaming generating outputs? #245

{{title}}

Replies: 5 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Do you support streaming generating outputs? #245

ltz0120 Jun 24, 2023

Replies: 5 comments

WoosukKwon Jun 25, 2023 Maintainer

zhuohan123 Jun 25, 2023 Maintainer

AI-General Jan 25, 2024

enze5088 Mar 5, 2024

Carolmelon May 2, 2024

ltz0120
Jun 24, 2023

WoosukKwon
Jun 25, 2023
Maintainer

zhuohan123
Jun 25, 2023
Maintainer

AI-General
Jan 25, 2024

enze5088
Mar 5, 2024

Carolmelon
May 2, 2024