-
-
Notifications
You must be signed in to change notification settings - Fork 11.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add KV Cache Memory Estimator Example Script
documentation
Improvements or additions to documentation
#29736
opened Nov 29, 2025 by
ksenthilnathan02
Loading…
[Feature][#29390]: Add timeout support to MultiprocExecutor.collective_rpc and FutureWrapper
v1
#29733
opened Nov 29, 2025 by
SandishKumarHN
Loading…
5 tasks
[Quantization] Enable compressed-tensors AWQ for Turing GPU
ready
ONLY add when PR is ready to merge/full CI is needed
#29732
opened Nov 29, 2025 by
Isotr0py
Loading…
1 of 5 tasks
[Misc] Update Improvements or additions to documentation
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
v1
TokenizerLike interface and move get_cached_tokenizer
ci/build
documentation
#29730
opened Nov 29, 2025 by
DarkLight1337
Loading…
5 tasks
[Frontend] Add streaming tool-call support to Responses API (non-Harmony)
frontend
gpt-oss
Related to GPT-OSS models
#29726
opened Nov 29, 2025 by
sumitaryal
Loading…
5 tasks
[V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync
speculative-decoding
v1
#29723
opened Nov 29, 2025 by
dongbo910220
Loading…
5 tasks
[WIP][Feat][Sched] Support Balance Scheduling
v1
#29721
opened Nov 29, 2025 by
GDzhu01
Loading…
5 tasks
[Bugfix] Suppress non-TTY color output on the process name part of the log
#29714
opened Nov 29, 2025 by
a4lg
Loading…
1 of 5 tasks
SM120 / NVFP4: add device guard and runtime SM dispatch to cutlass_scaled_fp4_mm
nvidia
#29711
opened Nov 29, 2025 by
hholtmann
Loading…
[perf] Use direct copy (broadcast) instead of cat for k_nope/k_pe in MLA prefill
v1
#29710
opened Nov 29, 2025 by
minosfuture
Loading…
5 tasks
[KVConnector] remove unused code (the model aware kv ops class)
kv-connector
#29709
opened Nov 29, 2025 by
KuntaiDu
Loading…
5 tasks
[KVConnector] Remove v0-related kv connector components such as kv pipe and kv lookup buffer
kv-connector
#29705
opened Nov 28, 2025 by
KuntaiDu
Loading…
5 tasks
FlashInfer-Bench Integration for vLLM
documentation
Improvements or additions to documentation
nvidia
v1
#29695
opened Nov 28, 2025 by
sfc-gh-goliaro
•
Draft
4 of 11 tasks
[Bugfix] Schedule failure due to wrong get_image_size_with_most_features
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#29692
opened Nov 28, 2025 by
tomtomjhj
Loading…
3 of 5 tasks
[WIP][Kernel]Support W4A8 Grouped GEMM on Hopper
ci/build
new-model
Requests to new models
nvidia
#29691
opened Nov 28, 2025 by
czhu-cohere
Loading…
5 tasks
[CI] Renovation of nightly wheel build & generation
ci/build
#29690
opened Nov 28, 2025 by
Harry-Chen
Loading…
3 of 5 tasks
[Chore]: Remove Olmo3 and FlexOlmo config copy
ready
ONLY add when PR is ready to merge/full CI is needed
#29677
opened Nov 28, 2025 by
Isotr0py
Loading…
1 of 5 tasks
[CI/build] Add libraries needed for building VLLM wheel to the test docker image.
ci/build
#29672
opened Nov 28, 2025 by
halyavin
Loading…
5 tasks
[NIXL] Add remote_request_id to kv_transfer_params
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#29665
opened Nov 28, 2025 by
markmc
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.