-
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI/Build] Remove limitation of NVCC_THREADS and MAX_JOBS > CPU count
ci/build
#13606
opened Feb 20, 2025 by
paul-grundmann
Loading…
[Kernel][Minor] Refactor macro parameter naming for consistency
#13605
opened Feb 20, 2025 by
haochengxia
Loading…
[Misc] Upgrade Improvements or additions to documentation
transformers
to 4.49
ci/build
documentation
#13602
opened Feb 20, 2025 by
ywang96
Loading…
[Platform] Refactor memory manage function in memory_profiling to Platform
#13599
opened Feb 20, 2025 by
ji-huazhong
Loading…
[V1][Minor] Print KV cache size in token counts
v1
#13596
opened Feb 20, 2025 by
WoosukKwon
Loading…
[MISC] optimizing _calc_mrope_positions
needs-rebase
v1
#13595
opened Feb 20, 2025 by
kevin-can
Loading…
[Bugfix] V1 Memory Profiling: V0 Sampler Integration without Rejection Sampler
v1
#13594
opened Feb 20, 2025 by
JenZhao
Loading…
Bump up the transformers version to v4.49.0
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#13592
opened Feb 20, 2025 by
WoosukKwon
Loading…
[core] set up data parallel communication
ci/build
v1
#13591
opened Feb 20, 2025 by
youkaichao
•
Draft
[V1][Sampler] Avoid an operation during temperature application
v1
#13587
opened Feb 20, 2025 by
njhill
Loading…
[Bugfix][CPU] Fix cpu all-reduce using native pytorch implementation
ready
ONLY add when PR is ready to merge/full CI is needed
#13586
opened Feb 20, 2025 by
Isotr0py
Loading…
[Model] Merged multimodal processor for Paligemma
needs-rebase
#13584
opened Feb 20, 2025 by
kylehh
Loading…
Switching test jobs into the amd_mi300 queue. // EXPERIMENTATION NO NEED TO MERGE
ci/build
#13576
opened Feb 20, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Bugfix] Change disaggregated prefill example model name reflecting the change on HF
#13573
opened Feb 20, 2025 by
TristonC
Loading…
[HTTP Server] Make model param optional in request
frontend
#13568
opened Feb 19, 2025 by
youngkent
Loading…
Remove unused kwargs from model definitions
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
v1
#13555
opened Feb 19, 2025 by
hmellor
Loading…
fix(chunked prefill): don't schedule prefill if freeing kv cache
#13539
opened Feb 19, 2025 by
toslunar
Loading…
[Frontend] Add backend-specific options for guided decoding
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
#13505
opened Feb 19, 2025 by
joerunde
Loading…
[V1][Metrics] Implement vllm:lora_requests_info metric
v1
#13504
opened Feb 18, 2025 by
markmc
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.