Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add OpenVLA model support
#29738 opened Nov 30, 2025 by yongming-qin Draft
Add KV Cache Memory Estimator Example Script documentation Improvements or additions to documentation
#29736 opened Nov 29, 2025 by ksenthilnathan02 Loading…
Fix AttributeError about _use_fi_prefill v1
#29734 opened Nov 29, 2025 by hl475 Loading…
5 tasks
[Quantization] Enable compressed-tensors AWQ for Turing GPU ready ONLY add when PR is ready to merge/full CI is needed
#29732 opened Nov 29, 2025 by Isotr0py Loading…
1 of 5 tasks
[Doc]: Fix typo in fused_moe layer
#29731 opened Nov 29, 2025 by BowTen Loading…
1 of 5 tasks
[Misc] Update TokenizerLike interface and move get_cached_tokenizer ci/build documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed v1
#29730 opened Nov 29, 2025 by DarkLight1337 Loading…
5 tasks
[Frontend] Add streaming tool-call support to Responses API (non-Harmony) frontend gpt-oss Related to GPT-OSS models
#29726 opened Nov 29, 2025 by sumitaryal Loading…
5 tasks
[WIP][Feat][Sched] Support Balance Scheduling v1
#29721 opened Nov 29, 2025 by GDzhu01 Loading…
5 tasks
add default stop string frontend
#29718 opened Nov 29, 2025 by mhm0902 Loading…
5 tasks
[Bugfix] Suppress non-TTY color output on the process name part of the log
#29714 opened Nov 29, 2025 by a4lg Loading…
1 of 5 tasks
Fix RoPE failures in Transformers nightly
#29700 opened Nov 28, 2025 by hmellor Loading…
FlashInfer-Bench Integration for vLLM documentation Improvements or additions to documentation nvidia v1
#29695 opened Nov 28, 2025 by sfc-gh-goliaro Draft
4 of 11 tasks
[Bugfix] Schedule failure due to wrong get_image_size_with_most_features multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#29692 opened Nov 28, 2025 by tomtomjhj Loading…
3 of 5 tasks
[WIP][Kernel]Support W4A8 Grouped GEMM on Hopper ci/build new-model Requests to new models nvidia
#29691 opened Nov 28, 2025 by czhu-cohere Loading…
5 tasks
[CI] Renovation of nightly wheel build & generation ci/build
#29690 opened Nov 28, 2025 by Harry-Chen Loading…
3 of 5 tasks
[Chore]: Remove Olmo3 and FlexOlmo config copy ready ONLY add when PR is ready to merge/full CI is needed
#29677 opened Nov 28, 2025 by Isotr0py Loading…
1 of 5 tasks
[NIXL] Add remote_request_id to kv_transfer_params kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#29665 opened Nov 28, 2025 by markmc Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.