-
-
Notifications
You must be signed in to change notification settings - Fork 10.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Spec Decode] Fix EAGLE + DP bug
speculative-decoding
v1
#27837
opened Oct 30, 2025 by
MatthewBonanni
Loading…
3 of 5 tasks
[CI/Build] Remove dashes for comments
ci/build
#27835
opened Oct 30, 2025 by
amdfaa
Loading…
5 tasks
[Kimi-Linear] Correct prefixes and add compatibility to AWQ quants
#27834
opened Oct 30, 2025 by
toncao
Loading…
4 tasks
[Bugfix][Multimodal][Torch Compile] Avoid compiling the same module definition many times
#27833
opened Oct 30, 2025 by
Lucaskabela
•
Draft
3 of 5 tasks
[Misc] Make all tool scripts executable
ready
ONLY add when PR is ready to merge/full CI is needed
#27831
opened Oct 30, 2025 by
MatthewBonanni
Loading…
5 tasks
[Test] Adjust abort sleep time to reduce AsyncLLM test flake
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27827
opened Oct 30, 2025 by
njhill
Loading…
[Cleanup] Remove no-longer-used ONLY add when PR is ready to merge/full CI is needed
SpeculativeConfig.enable_chunked_prefill
frontend
ready
#27826
opened Oct 30, 2025 by
njhill
Loading…
Docs update tpu install instructions
documentation
Improvements or additions to documentation
tpu
Related to Google TPUs
#27824
opened Oct 30, 2025 by
RobMulla
Loading…
4 of 5 tasks
Simplify vLLM deployment on AWS with new Ansible playbooks and step-by-step instructions & video guide
documentation
Improvements or additions to documentation
#27820
opened Oct 30, 2025 by
rlopez133
Loading…
2 tasks
[MLA] Separate Quant from unified_mla_attn op
v1
#27817
opened Oct 30, 2025 by
pavanimajety
•
Draft
5 tasks
[Misc] Refactor Attention kv transfer methods into decorator
#27816
opened Oct 30, 2025 by
NickLucche
Loading…
[BugFix] fix: skip check unstreamed tool arg tokens when tool call name is present
frontend
#27806
opened Oct 30, 2025 by
llsj14
Loading…
5 tasks
[Bugfix] Improve KV events subscriber with better event handling
documentation
Improvements or additions to documentation
#27803
opened Oct 30, 2025 by
Bevisy
Loading…
5 tasks
fix: skip AWQ-Marlin on ROCm in check_moe_marlin_supports_layer
ci/build
rocm
Related to AMD ROCm
#27801
opened Oct 30, 2025 by
yuttian1
Loading…
[V0 deprecation] Remove VLLM_USE_V1 usage in platform and v1 module
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
tpu
Related to Google TPUs
v1
#27798
opened Oct 30, 2025 by
wangxiyuan
Loading…
5 tasks
[Core][Perf] Replace isinstance(CrossAttentionManager) with a class member variable
v1
#27797
opened Oct 30, 2025 by
Jialin
Loading…
3 of 5 tasks
V0.11.0
ci/build
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
gpt-oss
Related to GPT-OSS models
kv-connector
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
needs-rebase
new-model
Requests to new models
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
speculative-decoding
tpu
Related to Google TPUs
v1
#27796
opened Oct 30, 2025 by
sidikbro
Loading…
[Core][Observability] Add KV cache residency metrics
v1
#27793
opened Oct 30, 2025 by
shivampr
Loading…
[Chore] eliminate duplicated and unconditional object serialization in anthropic messages api
frontend
#27792
opened Oct 30, 2025 by
vicoooo26
Loading…
5 tasks
[Bugfix] [CPU] bump torch to 2.9.0 for Darwin to fix segmentation fault
ci/build
#27791
opened Oct 30, 2025 by
kebe7jun
Loading…
3 of 5 tasks
[XPU] Add gpt-oss model support for Intel GPU
gpt-oss
Related to GPT-OSS models
v1
#27786
opened Oct 30, 2025 by
jikunshang
Loading…
5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.