Skip to content

Commit 37fd8a6

Browse files
authored
docs: Update Qwen3 32B AWQ streaming documentation (#453)
2 parents 61a0bcb + 6b3ae28 commit 37fd8a6

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

hub/public-endpoint-reference.mdx

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -220,14 +220,10 @@ response = client.chat.completions.create(
220220
model=model_name,
221221
messages=messages,
222222
max_tokens=525,
223-
stream_options={"include_usage": True},
224223
stream=True
225224
)
226225
```
227226

228-
<Tip>
229-
`stream_options={"include_usage": True}` is required for streaming to work with vLLM Public Endpoints.
230-
</Tip>
231227
</Accordion>
232228

233229
<Accordion title="Response format">

0 commit comments

Comments
 (0)