-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-11574][feat] Some updates for integrating Pre-Merge Report into CI Report
#12430
opened Mar 22, 2026 by
chenfeiz0326
Loading…
1 task done
[TRTLLM-11506][feat] Integrate Perf Triage Bot into CI Pipeline
#12429
opened Mar 22, 2026 by
chenfeiz0326
Loading…
1 task done
[None][feat] Add SJF (Shortest Job First) waiting queue scheduling policy
#12428
opened Mar 22, 2026 by
lancelly
Loading…
5 of 6 tasks
[None][feat] MLIR-based auto-generated elementwise fusion for AutoDeploy
#12427
opened Mar 22, 2026 by
suyoggupta
Loading…
5 tasks done
[None][fix] Use warnings.warn instead of raise for DeprecationWarning in fla chunk
Community want to contribute
PRs initiated from Community
#12421
opened Mar 21, 2026 by
Bias92
Loading…
3 tasks done
[TRTLLM-12291][feat] New sharding infrastructure
#12419
opened Mar 21, 2026 by
greg-kwasniewski1
Loading…
1 task done
[None][chore] Remove gpu-shell tool from ad-run-agent
#12418
opened Mar 21, 2026 by
govind-ramnarayan
•
Draft
1 task
[TRTLLM-10939][feat] Enable block reuse with overlap scheduler
#12416
opened Mar 20, 2026 by
chienchunhung
•
Draft
1 task done
[TRTLLM-11421][feat] Support better kv cache statistics monitoring
#12413
opened Mar 20, 2026 by
eopXD
Loading…
1 task done
[None][feat] Fuse all_reduce with norm for nemotron_h models
#12410
opened Mar 20, 2026 by
Wanli-Jiang
Loading…
1 task done
[None][test] Fix lora config less than required config number
#12409
opened Mar 20, 2026 by
yufeiwu-nv
Loading…
1 task done
[https://nvbugs/5866619][test] Add unit test for load_state_dict safetensors fallback
#12408
opened Mar 20, 2026 by
crazydemo
Loading…
1 task done
[https://nvbugs/5963665][refactor] Refactor warmup orchestration in M…
#12407
opened Mar 20, 2026 by
liji-nv
Loading…
1 task done
[https://nvbugs/5841976][fix] Remove test_fused_moe_alltoall_fp4[DeepEP] from waives
#12405
opened Mar 20, 2026 by
xxi-nv
Loading…
2 tasks done
[None][fix] Correct reused block counting on corner case
#12404
opened Mar 20, 2026 by
tongyuantongyu
Loading…
1 task done
[https://nvbugs/5916151][fix] Unwaive test_fused_moe_w4a8_nvfp4_fp8[TRTLLM]
#12400
opened Mar 20, 2026 by
xxi-nv
Loading…
2 tasks done
[https://nvbugs/5962591][fix] Fix Triton resmooth kernel crash on SM100f for large MoE grids
#12397
opened Mar 20, 2026 by
Barry-Delaney
Loading…
[None][feat] Temporally-Correlated Heuristic-guided Indexer TopK for Sparse Attention
#12385
opened Mar 20, 2026 by
longcheng-nv
Loading…
5 tasks done
[None][feat] Add NvTelemetry/GXT-compliant usage telemetry
#12384
opened Mar 20, 2026 by
venkywonka
Loading…
9 of 10 tasks
[None][feat] Support MLA in TrtllmGen attention backend
#12383
opened Mar 20, 2026 by
yihwang-nv
•
Draft
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-02-22.