Fix packet loss when deploy little model #2548

sdli1995 · 2024-12-23T01:17:54Z

fix packet loss when deploy small llm like qwen 0.5B or fishauido

Motivation

We found that when deploying some small LLMs, SGlang experiences packet loss under high load because the scheduler's results return too quickly, and at the same time, the rid is not updated in rid_to_state

Modifications

we move send_to_scheduler after rid_to_state update

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

0.5B

merrymercy · 2024-12-28T22:09:28Z

@sdli1995 Can you fix the lint and CI errors?
https://sgl-project.github.io/references/contributor_guide.html#format-your-code

merrymercy · 2025-01-02T23:59:47Z

move to #2717

fix packet loss when deploy little model like qwen

c68259c

0.5B

merrymercy added the high priority label Dec 26, 2024

merrymercy self-assigned this Dec 26, 2024

Astrali added 2 commits December 31, 2024 17:40

lint code

d4c281d

Merge branch 'main' into fix_packet_loss

48123ce

merrymercy marked this pull request as ready for review January 2, 2025 23:33

merrymercy requested review from merrymercy, Ying1123 and hnyls2002 as code owners January 2, 2025 23:33

Merge branch 'main' into fix_packet_loss

d6b5d6e

merrymercy mentioned this pull request Jan 2, 2025

Fix package loss for small models #2717

Merged

merrymercy closed this Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix packet loss when deploy little model #2548

Fix packet loss when deploy little model #2548

sdli1995 commented Dec 23, 2024

merrymercy commented Dec 28, 2024 •

edited

Loading

merrymercy commented Jan 2, 2025

Fix packet loss when deploy little model #2548

Fix packet loss when deploy little model #2548

Conversation

sdli1995 commented Dec 23, 2024

Motivation

Modifications

Checklist

merrymercy commented Dec 28, 2024 • edited Loading

merrymercy commented Jan 2, 2025

merrymercy commented Dec 28, 2024 •

edited

Loading