-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
doc: [TRTLLM-6089] Add long sequence document for Feature section
#6575
opened Aug 3, 2025 by
lfr-0531
[None][fix] xqa precision for fp16/bf16 kv cache
Community want to contribute
PRs initiated from Community
#6573
opened Aug 3, 2025 by
Bruce-Lee-LY
Loading…
[TRTLLM-5563][infra] Move test_rerun.py to script folder
#6571
opened Aug 2, 2025 by
yiqingy0
Loading…
[None][feat] Add GPTQ Int8 Options for Qwen TRT path
#6568
opened Aug 1, 2025 by
JyChang012
Loading…
[TRTLLM-6090] doc: Add multimodal part to the feature section
#6567
opened Aug 1, 2025 by
chang-l
Loading…
[None][doc] Add new doc
Community want to contribute
PRs initiated from Community
#6565
opened Aug 1, 2025 by
jamieliNVIDIA
Loading…
[TRTLLM-5500][infra] Update CODEOWNERS with new ownership rules for additional paths
#6564
opened Aug 1, 2025 by
venkywonka
•
Draft
Draft: feat: Include attention dp rank info with KV cache events
#6563
opened Aug 1, 2025 by
pcastonguay
Loading…
[TRTLLM-6069]doc: add trtllm-serve usage for cli reference section
#6562
opened Aug 1, 2025 by
nv-guomingz
Loading…
[None][doc] Create deployment guide for Llama4 Scout FP8 and NVFP4
#6550
opened Aug 1, 2025 by
chenfeiz0326
Loading…
[None][feat] improve dataloading for benchmark_dataset by using batch…
#6548
opened Aug 1, 2025 by
zerollzeng
Loading…
[TRTLLM-4501][feat] AutoTuner tuning config refactor and add tuning for kernel configs.
#6545
opened Aug 1, 2025 by
hyukn
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.