Skip to content

Some questions about TTFT and TPOT benchmarks #1229

Closed Answered by merrymercy
sitabulaixizawaluduo asked this question in Q&A
Discussion options

You must be logged in to vote

When the streaming is enabled, sglang won't use num_continue_decode_steps, according to this

if self.out_pyobjs and self.running_batch.has_stream:

These configs are set with some simple heuristic so you can play with these arguments for your own workloads.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by merrymercy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1228 on August 27, 2024 09:02.