Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Temp
#6578 opened Aug 3, 2025 by netanel-haber Draft
[TRTLLM-6174][feat] Enable FP32 mamba ssm cache
#6574 opened Aug 3, 2025 by shaharmor98 Loading…
[None][fix] xqa precision for fp16/bf16 kv cache Community want to contribute PRs initiated from Community
#6573 opened Aug 3, 2025 by Bruce-Lee-LY Loading…
[None][fix] Fix attention dp log
#6570 opened Aug 2, 2025 by Shunkangz Loading…
[None][doc] Add new doc Community want to contribute PRs initiated from Community
#6565 opened Aug 1, 2025 by jamieliNVIDIA Loading…
draft: measure the first two tokens interval
#6561 opened Aug 1, 2025 by Shixiaowei02 Loading…
[None][chore] Add unit test for Gemma3 lora
#6560 opened Aug 1, 2025 by brb-nv Loading…
[None][infra] fix Build Docker Image tag issue
#6555 opened Aug 1, 2025 by ZhanruiSunCh Loading…
feat: Support custom repo_dir for SLURM script
#6546 opened Aug 1, 2025 by kaiyux Loading…
ProTip! Follow long discussions with comments:>50.