-
Notifications
You must be signed in to change notification settings - Fork 588
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] (Willing to PR) Avoid KV cache occupying GPU memory when not used
feature
#2542
opened Dec 22, 2024 by
fzyzcjy
2 tasks done
[Bug] Eagle2 has an unstable sampling rate during multi concurrency。
#2537
opened Dec 21, 2024 by
coolhok
5 tasks done
[Bug] Transformers doesn't recognize LLaVA variant architectures
#2532
opened Dec 20, 2024 by
amosyou
5 tasks done
[Bug] RuntimeRrror: Ninja is required to load c++ extensions
#2514
opened Dec 19, 2024 by
Flynn-Zh
2 of 5 tasks
[Feature] Support for Evicting Specific KV Cache to Save GPU Memory
#2510
opened Dec 18, 2024 by
ChenlongDeng
2 tasks done
[Feature] Integration SGLang into OpenRLHF
collaboration
high priority
#2506
opened Dec 17, 2024 by
zhaochenyang20
2 tasks done
[Feature] Add Tutorial for Constraint Decoding
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#2505
opened Dec 17, 2024 by
zhaochenyang20
2 tasks done
[Feature] Add Math in our CI
enhancement
New feature or request
good first issue
Good for newcomers
#2504
opened Dec 17, 2024 by
zhaochenyang20
2 tasks
[Feature] Benchmarking Performance on General Devices
collaboration
enhancement
New feature or request
#2488
opened Dec 16, 2024 by
zhaochenyang20
2 tasks done
[Feature] request smoothquant (int8, W8A8) quantization on 40G A100
#2474
opened Dec 13, 2024 by
Hao-YunDeng
2 tasks done
[Feature] Integrate CUTLASS FP8 GEMM into sgl-kernel
high priority
performance
quant
LLM Quantization
#2472
opened Dec 12, 2024 by
zhyncs
2 tasks
[Feature] FusedMoE H200 tuning
enhancement
New feature or request
#2471
opened Dec 12, 2024 by
zhyncs
2 tasks
[Bug] Different behavior benchmarking w/ request-range-range vs. separate request-rates
#2470
opened Dec 12, 2024 by
Mutinifni
5 tasks done
[Feature] Do we have any plan for supporting MiniCPM-V 2.6?
collaboration
#2461
opened Dec 12, 2024 by
Xeladoes
2 tasks done
[Feature]: Benchmarking H200
good first issue
Good for newcomers
high priority
#2450
opened Dec 11, 2024 by
antferdom
2 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.