-
Notifications
You must be signed in to change notification settings - Fork 119
Issues: vllm-project/aibrix
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Ecosystem] NIM inference with AIBrix
kind/support
Categorizes issue as a support question.
#735
opened Feb 24, 2025 by
gaocegege
[Question] How to access the vLLM-Vineyard integration code mentioned in Distributed KV Cache documentation?
#733
opened Feb 24, 2025 by
cheyang
[router] Document supported APIs
area/website
kind/documentation
Improvements or additions to documentation
Improving benchmarking token counting accuracy
area/heterogeneous
#726
opened Feb 21, 2025 by
nwangfw
Improving benchmarking scripts with real prompts in heterogenous GPU story
area/heterogeneous
kind/bug
Something isn't working
#722
opened Feb 20, 2025 by
nwangfw
[Discussion][Umbrella] ModelAdapter Issues
area/lora
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#700
opened Feb 18, 2025 by
kerthcet
1 of 5 tasks
Research & Industry Collaboration Invitation
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
#699
opened Feb 18, 2025 by
Jeffwan
v0.3.0 roadmap
kind/enhancement
New feature or request
kind/feature
Categorizes issue or PR as related to a new feature.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
Failed to connect to vineyard via both IPC and RPC connection
area/kv-cache
kind/bug
Something isn't working
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#696
opened Feb 18, 2025 by
Jeffwan
MountVolume.SetUp failed for volume "kube-api-access-wfmh2" no space left on device
area/kv-cache
kind/bug
Something isn't working
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
Failed to download artifacts in installation-test CI job
area/cicd
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#690
opened Feb 17, 2025 by
Jeffwan
Examples should come with health and readiness checks
good first issue
Good for newcomers
help wanted
Extra attention is needed
kind/documentation
Improvements or additions to documentation
kind/support
Categorizes issue as a support question.
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#685
opened Feb 16, 2025 by
Jeffwan
controller manager crashes - need more investigation
area/installation
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#684
opened Feb 16, 2025 by
Jeffwan
redis is not that stable and quit from SIGTERM
area/installation
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#683
opened Feb 16, 2025 by
Jeffwan
Supporting varying number of pods in radix-tree cache data structure.
area/gateway
kind/enhancement
New feature or request
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
Prefix cache and load aware routing policy
area/gateway
area/kv-cache
area/performance
area/scheduling
kind/enhancement
New feature or request
kind/feature
Categorizes issue or PR as related to a new feature.
[router] Use string instead of token ids
area/gateway
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#673
opened Feb 14, 2025 by
gaocegege
make sure the request can be random sent to server within 1s instead of batching way
area/benchmark
#667
opened Feb 13, 2025 by
Jeffwan
Improve the autoscaling benchmark scripts
area/benchmark
area/performance
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#666
opened Feb 13, 2025 by
Jeffwan
10 tasks
Testing AIBrix on Lambda Cloud
kind/documentation
Improvements or additions to documentation
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#658
opened Feb 12, 2025 by
Jeffwan
v0.2.0-rc.2 failed to pin container images
area/installation
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#652
opened Feb 12, 2025 by
Jeffwan
Previous Next
ProTip!
Adding no:label will show everything without a label.