-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[https://nvbugs/5945047][fix] [TensorRT-LLM][L0][Post-Merge][main] Test failed:
#13169
opened Apr 17, 2026 by
ziyixiong-nv
Collaborator
Loading…
2 tasks done
[None][chore] Add Dynamo configs to TRTLLM CI - Disagg - Part 2
#13168
opened Apr 17, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[None][chore] Add Dynamo configs to TRTLLM CI - Disagg - Part 1
#13167
opened Apr 17, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[None][fix] Raise clear error when GPT-OSS is used with non-TRTLLM attention backend
Community want to contribute
PRs initiated from Community
#13166
opened Apr 17, 2026 by
ssam18
Contributor
Loading…
[https://nvbugs/6050481][fix] DONT REVIEW CI DEBUG
#13165
opened Apr 17, 2026 by
dongfengy
Collaborator
Loading…
1 task done
[None][fix] Clean up Triton related tests
#13163
opened Apr 17, 2026 by
Tabrizian
Member
Loading…
1 task done
[https://nvbugs/6074014][fix] Min-reduce available host memory to ensure that all ranks agree about whether prefetch is enabled
#13161
opened Apr 17, 2026 by
dhansen-nvidia
Collaborator
Loading…
1 task done
[None][chore] improve gemm perf for nemotron in spark
#13160
opened Apr 17, 2026 by
ttyio
Collaborator
Loading…
1 task done
[None][chore] Bump version to 1.3.0rc13
#13159
opened Apr 17, 2026 by
VALLIS-NERIA
Collaborator
Loading…
1 task done
[None][feat] Bala/minimax perf2
#13158
opened Apr 17, 2026 by
bmarimuthu-nv
Collaborator
•
Draft
1 task
[NVBUG-6086538][fix] suppress misleading skip-softmax FMHA warning in generation
#13157
opened Apr 17, 2026 by
bobboli
Collaborator
Loading…
[#13125][feat] Make auto_deploy standalone-ready and add package generator
#13155
opened Apr 17, 2026 by
lucaslie
Member
Loading…
3 of 4 tasks
[Don't Review & Merge][None][chore] Default on trtllm_gen attention backend with host code perf optimization
#13154
opened Apr 17, 2026 by
yihwang-nv
Collaborator
Loading…
1 task
[None][test] amend for qa weekly core test list
#13153
opened Apr 17, 2026 by
ruodil
Collaborator
Loading…
1 task done
[None][test] Add doc test
#13152
opened Apr 17, 2026 by
StanleySun639
Collaborator
Loading…
1 task done
[None][fix] Fix Mamba cache slot leak with MTP speculative decoding
#13151
opened Apr 17, 2026 by
Wanli-Jiang
Collaborator
•
Draft
1 task done
[TRTLLM-11999][feat] Add GLM-4.7/GLM-5 tool parser
#13150
opened Apr 17, 2026 by
JunyiXu-nv
Collaborator
Loading…
1 task
[None][perf] reduce @torch.library.custom_op host overhead
#13149
opened Apr 17, 2026 by
luyiyun1021
Collaborator
•
Draft
1 task done
Afmoe trinity support
Community want to contribute
PRs initiated from Community
#13148
opened Apr 17, 2026 by
alyosha-swamy
Loading…
1 task
[None][test] Waive 1 failed cases for main in QA CI
#13147
opened Apr 17, 2026 by
xinhe-nv
Collaborator
Loading…
[None][feat] Add multi-node support for VisualGen diffusion workers via torchrun/SLURM
#13140
opened Apr 16, 2026 by
venmugil
Collaborator
Loading…
1 task done
[None][feat] DO NOT MERGE: Stacked changes for nano nemotron omni
#13138
opened Apr 16, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.