-
-
Notifications
You must be signed in to change notification settings - Fork 5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix][V1] Fix test_kv_cache_utils.py
ready
ONLY add when PR is ready to merge/full CI is needed
#11738
opened Jan 4, 2025 by
jeejeelee
Loading…
[Frontend] Improve API Server Error Messages
frontend
#11737
opened Jan 4, 2025 by
robertgshaw2-neuralmagic
•
Draft
[Model] Remove unnecessary weight initialization logic
ready
ONLY add when PR is ready to merge/full CI is needed
#11736
opened Jan 4, 2025 by
DarkLight1337
Loading…
[Bugfix] Fix precision error in LLaVA-NeXT feature size calculation
ready
ONLY add when PR is ready to merge/full CI is needed
#11735
opened Jan 4, 2025 by
DarkLight1337
Loading…
[V1] Support audio language models on V1
documentation
Improvements or additions to documentation
#11733
opened Jan 4, 2025 by
ywang96
Loading…
[Bugfix] Validate lora adapters to avoid crashing server
frontend
#11727
opened Jan 3, 2025 by
joerunde
Loading…
[Ignore] Test multi-modal models extended
documentation
Improvements or additions to documentation
#11722
opened Jan 3, 2025 by
mgoin
Loading…
[Model] LoRA with lm_head and embed_tokens fully trained - 4
#11714
opened Jan 3, 2025 by
sergeykochetkov
Loading…
5 tasks done
[Frontend] Add segments to OpenAI Requests
documentation
Improvements or additions to documentation
frontend
#11713
opened Jan 3, 2025 by
ruediste
Loading…
[Kernel][Triton][AMD] Change default block size for triton_scaled_mm to 128 for 3-5x speedup
#11698
opened Jan 3, 2025 by
rasmith
Loading…
[Hardware][Apple] Native support for macOS Apple Silicon
ci/build
documentation
Improvements or additions to documentation
frontend
#11696
opened Jan 2, 2025 by
wallashss
Loading…
[V1] Add BlockTable class
ready
ONLY add when PR is ready to merge/full CI is needed
#11693
opened Jan 2, 2025 by
WoosukKwon
Loading…
[Frontend] Add split_special_tokens to the Tokenize Endpoint
frontend
#11691
opened Jan 2, 2025 by
ruediste
Loading…
k8s-config: Update the secret to use stringData
documentation
Improvements or additions to documentation
#11679
opened Jan 2, 2025 by
surajssd
Loading…
[torch.compile] Hide KV cache behind torch.compile boundary
#11677
opened Jan 2, 2025 by
heheda12345
•
Draft
[Bugfix][SpecDecode] Adjust Eagle model architecture to align with intended design
ready
ONLY add when PR is ready to merge/full CI is needed
#11672
opened Jan 1, 2025 by
llsj14
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.