-
Notifications
You must be signed in to change notification settings - Fork 39
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Spec Decoding][Bugfix] Use draft_config properly to support other models
#1142
opened Nov 20, 2025 by
py4
Loading…
[MISC] Removed problematic local path for CONFTEST_DIR
#1141
opened Nov 20, 2025 by
JiriesKaileh
Loading…
Use FP8_e5m2 automatically when using quantized kv cache FP8 on trillium
#1136
opened Nov 20, 2025 by
zixi-qi
Loading…
[TPU Offload] Separate offload manager and cpu-cache backend, and code structure refactor
#1122
opened Nov 18, 2025 by
juncgu-google
Loading…
Enable Pipeline Parallelism on Jax models
#1077
opened Nov 12, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax runner
#1053
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
[Docs] fix dead links in multiple documentation pages
#1027
opened Nov 6, 2025 by
mattheliu
Loading…
3 tasks done
[FIX] Add dummy get_input_embeddings to fix vLLM model type check
#971
opened Oct 29, 2025 by
kuafou
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.