-
-
Notifications
You must be signed in to change notification settings - Fork 7.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core] Update error message for Whisper + num-scheduler-steps > 1
ready
ONLY add when PR is ready to merge/full CI is needed
#19286
opened Jun 6, 2025 by
russellb
Loading…
[Bugfix]: Fix TypeError: 'float' object cannot be interpreted as an integer
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19283
opened Jun 6, 2025 by
chaunceyjiang
Loading…
[Easy][Test] Simplify test_function_tool_use with multiple parametrizes
ready
ONLY add when PR is ready to merge/full CI is needed
#19269
opened Jun 6, 2025 by
houseroad
Loading…
3 tasks done
Convert kv_transfer_config from dict to KVTransferConfig to fix #19259
frontend
#19262
opened Jun 6, 2025 by
maobaolong
Loading…
[New Model]: Support Qwen3 Embedding & Reranker
frontend
#19260
opened Jun 6, 2025 by
noooop
Loading…
[CPU] Fix torch version in x86 CPU backend and refine default configurations
ci/build
multi-modality
Related to multi-modality (#4194)
v1
#19258
opened Jun 6, 2025 by
bigPYJ1151
Loading…
2 of 3 tasks
[Misc] refactor context extension
documentation
Improvements or additions to documentation
#19246
opened Jun 6, 2025 by
reidliu41
Loading…
3 tasks
[RFC] Make max chunk bytes configurable
needs-rebase
v1
#19242
opened Jun 6, 2025 by
jennyyyyzhen
Loading…
Support no privileged mode on CPU for docker and kubernetes deployments
#19241
opened Jun 5, 2025 by
louie-tsai
Loading…
1 of 3 tasks
[Core] Allow vLLM to stream n tokens at a time
frontend
v1
#19240
opened Jun 5, 2025 by
rohingarg-c
Loading…
[Frontend] Simplify and improve error message in tool_choice validation
frontend
#19239
opened Jun 5, 2025 by
22quinn
Loading…
3 tasks done
Use PyTorch util for traced files instead of monkey-patching inline_call()
needs-rebase
#19235
opened Jun 5, 2025 by
jbschlosser
Loading…
[Bugfix] Add padding for block-scale fused-moe weights for AITER lib
#19234
opened Jun 5, 2025 by
qli88
Loading…
[Perf] Optimizations for int8 quant kernels
#19233
opened Jun 5, 2025 by
yewentao256
Loading…
3 tasks done
[Bugfix] Fix Qwen2-Audio chat template for online serving
frontend
#19230
opened Jun 5, 2025 by
Isotr0py
Loading…
1 of 3 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.