Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: prevent prefill starvation under high decode load
#4532 opened Apr 16, 2026 by grimoire Collaborator Loading…
Mixed modality
#4531 opened Apr 16, 2026 by CUHKSZzxy Collaborator Loading…
optimize get_sorted_idx in moe
#4529 opened Apr 15, 2026 by grimoire Collaborator Loading…
Test: update video sleep/wakeup and abort scenarios
#4528 opened Apr 15, 2026 by littlegy Contributor Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524 opened Apr 14, 2026 by windreamer Collaborator Loading…
[WIP]: Fix mtp experts
#4520 opened Apr 13, 2026 by RunningLeon Collaborator Loading…
fix qwen3.5 shared_expert_all_reduce
#4515 opened Apr 10, 2026 by yao-fengchen Collaborator Draft
make fp8 model quantized by llm-compressor can be inferenced in turbomind enhancement New feature or request
#4509 opened Apr 8, 2026 by 43758726 Collaborator Loading…
support more message item types
#4501 opened Apr 7, 2026 by CUHKSZzxy Collaborator Draft
fix: handle missing KV cache without crashing engine Bug:P0
#4497 opened Apr 4, 2026 by lvhan028 Collaborator Loading…
Integrate deep-ep nccl backend enhancement New feature or request
#4477 opened Mar 27, 2026 by irexyc Collaborator Loading…
feat: Turbomind linear gdn prefix caching enhancement New feature or request
#4465 opened Mar 25, 2026 by lapy Contributor Loading…
refactor get_ppl improvement
#4461 opened Mar 25, 2026 by lvhan028 Collaborator Loading…
feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families enhancement New feature or request
#4460 opened Mar 24, 2026 by lapy Contributor Loading…
Support multi stop words improvement
#4454 opened Mar 24, 2026 by lvhan028 Collaborator Loading…
[WIP] Support qwen3-omni
#4411 opened Mar 13, 2026 by CUHKSZzxy Collaborator Draft
2 of 4 tasks
Fix Structured Output for GPT-OSS Models
#4386 opened Mar 2, 2026 by windreamer Collaborator Loading…
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-03-17.