-
Notifications
You must be signed in to change notification settings - Fork 1.9k
[None][chore] Mass integration of release/1.0 - 3rd #7519
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
45 commits
Select commit
Hold shift + click to select a range
450522e
[https://nvbugs/5451028][fix] Constrain NemotronSuper test parameters…
Naveassaf ed70d06
[https://nvbugs/5448579][fix] EXAONE-4.0 accuracy test bugfix (#6888)
yechank-nvidia 06c64a7
[None][chore] Waive E2E GB200 tests for Gemma3 27B (#6916)
brb-nv a60af95
[https://nvbugs/5451296][bug] Fix a thread leak in test_llm_args.py (…
Tabrizian dfbde64
[None][infra] Waive failed tests for release branch (#7036)
EmmaQiaoCh 6ca71f5
[None][doc] add status labels to LLM class's api reference (#6899)
Superjomn aa6603c
[https://nvbugs/5448437][fix] fix some nixl tests (#6940)
bo-nv b33b27a
[https://nvbugs/5427801][fix] Torch compile support for Llama4 and Ea…
liji-nv c1e6126
[https://nvbugs/5394392][fix] Enlarge scheduler capacity under disagg…
yifeizhang-c ede27da
[TRTLLM-7263][fix] Prevent recreation of cublas handles in lora_group…
amitz-nv 835192d
[None][doc] update v1.0 doc for trtllm-serve (#7056)
hchings e7bc4a6
[https://nvbugs/5440241][fix] Fix 70B GSM8K Accuracy drop (#7075)
chenfeiz0326 f8a37bb
[https://nvbugs/5451296][fix] zmq nonblock bug with retry (#7019)
Superjomn 5ff7b61
[https://nvbugs/5383702][fix] test_llm_api_pytorch.py::TestLlama3_1_8…
Superjomn 990a786
[https://nvbugs/5392414] [fix] For release 1.0 cherry pick. Add custo…
ChristinaZ 30f30f7
[https://nvbugs/5464088] [fix] dequantize fp8 activation input to lor…
venkywonka 4b978c8
[None][infra] Skip failed tests for release branch (#7130)
EmmaQiaoCh d49e304
[https://nvbugs/5448442][fix] Skip trtllm moe backend for sm120 (#7010)
pamelap-nvidia 37823b9
[https://nvbugs/5449032][fix] Add more llm-args to llm_mgmn_trtllm_be…
brb-nv f4a8c04
[https://nvbugs/5410391][bug] Support to share device buffers in atte…
HuiGao-NV c2fecf3
[https://nvbugs/5467062][fix] pass logitsPostProcessorBatched by refe…
milesial c128437
[https://nvbugs/5450074][fix] Reduce the device memory requirements f…
Shixiaowei02 5a42ddc
[https://nvbugs/5433545][fix] TestPhi4MiniInstruct::test_auto_dtype -…
moraxu b20ea82
[https://nvbugs/5448426][fix] Fix illegal memory access in cuda graph…
peaceh-nv 633a4d5
[None][fix] Switch llm api quickstart example location per workflow. …
nv-guomingz 2a3e17f
[https://nvbugs/5467232][fix] Fix load_torch_hf_lora to override lora…
Wanli-Jiang 26dbb32
[None][doc] fix tensorrt legacy quickstart page (#7190)
Superjomn 26db89f
[https://nvbugs/5470840][fix] Disaggregated unit test MPI Init handli…
pcastonguay 3854ef1
[None][test] add kv cache size in bench metric and fix failed cases (…
ruodil ce80090
[https://nvbugs/5409416][fix] test_openai_multi_chat_example (#7174)
Linda-Stadter 3af7b1a
[https://nvbugs/5473789][bug] install cuda-toolkit to fix sanity chec…
HuiGao-NV 7b65fd4
[https://nvbugs/5473789][bug] install cuda-toolkit to fix sanity chec…
dominicshanshan 642f622
[None][fix] fix log_once usage (#7210)
yuxianq 9165e67
[None][infra] Waive failed cases for release/1.0 (#7258)
EmmaQiaoCh 31aaed5
[https://nvbugs/5451342][fix] Use runtime max_batch_size when cuda_gr…
jiaganc 1cfb4af
[None][feat] Skip prefetching consolidated safetensors when appropria…
2ez4bz 70197dd
[https://nvbugs/5430125][ci] Unwaive test case for mistral 3.1 small …
2ez4bz e1d8811
[https://nvbugs/5478151][fix] Add missing spec for Llama-3.3 70B (#7267)
brb-nv 413776a
[https://nvbugs/5451426][fix] Avoid torch compile on full eagle3 work…
liji-nv 7ff6f44
[https://nvbugs/5463720][fix] tp-split the inferred `mlp_hidden_size`…
venkywonka cc71861
[https://nvbugs/5480550][fix] Increase timeout for Gemma3 27B test (#…
brb-nv 8797444
[https://nvbugs/5434320][bug] Fix disagg pp bug (#7099)
Tabrizian d279d29
[https://nvbugs/5480415][fix] Fix phi4mm multi-gpu test (#7275)
Wanli-Jiang 2083332
[TRTLLM-7346][fix] Improve performance of PyTorchModelEngine._get_lor…
amitz-nv 74fc47c
[https://nvbugs/5461712] [fix] Disable deep_gemm for Qwen3 due to acc…
DomBrown File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.