-
Notifications
You must be signed in to change notification settings - Fork 588
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat:support 2 kenrels for mixed chunked prefill
#2546
opened Dec 22, 2024 by
chosen-ox
Loading…
2 tasks
Related to #2505 Updated documentation to clarify Xgrammar features.
#2545
opened Dec 22, 2024 by
shuaills
Loading…
1 of 3 tasks
Enable Nvidia's ModelOpt fp8 quantized models
#2535
opened Dec 21, 2024 by
Edwardf0t1
Loading…
3 tasks
[Feature] Support new parameter - EBNF in xgrammar
#2526
opened Dec 19, 2024 by
adarshxs
Loading…
2 of 3 tasks
adapt custom allreduce for tensorrt llm
high priority
#2511
opened Dec 18, 2024 by
yizhang2077
Loading…
3 tasks
improve performance by removing use_tensor_core dependency
await-response
#2496
opened Dec 17, 2024 by
bjmsong
Loading…
3 tasks
[Experimental] Add a gRPC server for completion request
high priority
#2478
opened Dec 13, 2024 by
MrAta
Loading…
2 of 3 tasks
[FIX] Update EOS from config
await-response
#2475
opened Dec 13, 2024 by
zhengy001
Loading…
1 of 3 tasks
Add InfiniteBench for long context benchmarking
high priority
#2421
opened Dec 9, 2024 by
iankur
Loading…
2 of 3 tasks
feat: use cascade attention kernel (single level)
#2101
opened Nov 20, 2024 by
james-p-xu
•
Draft
1 of 3 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.