-
Notifications
You must be signed in to change notification settings - Fork 1.9k
[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices #7568
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 95 commits
Commits
Show all changes
108 commits
Select commit
Hold shift + click to select a range
303604f
upgrade to base image and new TRT, fix many dependency issues
VALLIS-NERIA 5c09dc8
CUDA13 breaking changes: c++ compile successful
VALLIS-NERIA 1b84604
fix kernel select code to recognize sm103/sm100f
VALLIS-NERIA 3a94d80
Update SM100f cubins
Tom-Zheng 469a38d
feat: Add support for SM103 3xFP4 tile shapes
djns99 52ad443
disable 3xfp4
VALLIS-NERIA 345c2bc
update trtllm-gen sm100f cubins of gemm kernels
VALLIS-NERIA e27cbb5
Ampere moe kernel should build to all arch
VALLIS-NERIA 78a55b8
fix vicuna dependency
VALLIS-NERIA 271916d
fix deep_gemm & CUDA13
VALLIS-NERIA 886437d
merge existing env fix
VALLIS-NERIA b782b6e
fix sm check of kv reuse and chunked context
VALLIS-NERIA 84f96b4
update triton and fix deepgemm pip
VALLIS-NERIA 759e7a0
Merge remote-tracking branch 'gitlab/main' into feat/gb110_bringup
VALLIS-NERIA bee1df9
remove deepgemm war
VALLIS-NERIA 97a3788
update triton image
VALLIS-NERIA ebec4ea
infra: upgrade to DLFW 25.08-pre and TRT 10.13.2.4
ZhanruiSunCh 36f2e88
Merge branch 'user/zhanruis/update_dlfw_and_cu13' into 'feat/b300_cu13'
ZhanruiSunCh 0bf6a18
Fix and waive to clean L0
VALLIS-NERIA f12a90b
Merge branch 'feat/gb110_bringup' into 'feat/b300_cu13'
VALLIS-NERIA 8c99853
infra: Support build for both CU12 and CU13
ZhanruiSunCh c1014e8
Merge branch 'user/zhanruis/update_dlfw_and_cu13_2' into 'feat/b300_c…
ZhanruiSunCh 4a95d88
revert tlg kernels for ease of merge
VALLIS-NERIA 8b53236
Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_mai…
VALLIS-NERIA 5391191
update tg cubins (temp ver)
VALLIS-NERIA f4de884
Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_mai…
VALLIS-NERIA b7cc06c
disable merge waive list stage
VALLIS-NERIA fa8b52e
fix more sm version check
VALLIS-NERIA 808059d
Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_mai…
VALLIS-NERIA 90a9bc4
fix build error
VALLIS-NERIA 80ea062
fix cubins
VALLIS-NERIA 66b1d8d
Update flashinfer
VALLIS-NERIA ab7febd
Merge commit '31979aefacbf80d2742c98ef30385db162788c84' into feat/b30…
VALLIS-NERIA b1c6f6a
update cutlass and DeepGEMM
VALLIS-NERIA 9ad68de
Merge branch 'user/xiweny/update_cutlass_4.2' into 'feat/b300_cu13'
VALLIS-NERIA ee37589
infra: update DLFW 25.08 GA, triton 25.08 GA
ZhanruiSunCh c2e1ad9
Merge branch 'user/zhanruis/update_dlfw_and_cu13_3' into 'feat/b300_c…
ZhanruiSunCh 6fd765f
[None][fix] fix trtllm moe backend error when running gptoss on b300
jiaganc f14c740
Merge branch 'dev-jiaganc-fix-b300-gptoss-trtllm' into 'feat/b300_cu13'
VALLIS-NERIA 3c06303
[TRTLLM-7755][infra] Add DGX_B300 and GB300 tests in CI
yiqingy0 c425c12
Merge branch 'user/yiqingy/add_b300_tests' into 'feat/b300_cu13'
yiqingy0 0fb835d
fix cutlass moe not falling back
VALLIS-NERIA 8d5a7ea
[https://nvbugs/5443053][fix] Disable finalize fusion when Lora is used
jiaganc 3cc2591
Merge branch 'dev-jiaganc-fix-b300-moe-lora' into 'feat/b300_cu13'
VALLIS-NERIA 3805f61
[https://nvbugs/5453949][infra] unwaive test_llama_eagle3
bo-nv a765ee4
Merge branch 'feat/b300_cu13-latest' into 'feat/b300_cu13'
VALLIS-NERIA 14154ec
disable sm103 moe kernel
VALLIS-NERIA 38ef850
Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_0901
VALLIS-NERIA 62a7897
Merge remote-tracking branch 'origin/main' into user/xiweny/merge_0901
VALLIS-NERIA 90ce786
Fix arg name in _test_trtllm_serve_multimodal_benchmark.py
VALLIS-NERIA 5bd50d4
update mha cubins and support 103a
VALLIS-NERIA 1978227
Merge branch 'user/xiweny/mha_103' into 'feat/b300_cu13'
VALLIS-NERIA 5ca3376
Support DLFW sanity check use CU13 image
ZhanruiSunCh 9ae01a8
Merge branch 'user/zhanruis/0828_support_cuda_13_for_sanity_check' in…
ZhanruiSunCh 973fd37
add 3xfp4 cutlass gemm
VALLIS-NERIA fcf413e
Merge branch 'user/xiweny/3xfp4_gemm' into 'feat/b300_cu13'
VALLIS-NERIA 5d4f7f4
update flashinfer and waive bug
VALLIS-NERIA 22219bc
Add B300 & GB300 CI
VALLIS-NERIA 2c3f4cb
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA f8864b9
update trtllm gemm
VALLIS-NERIA cca347e
[TRTLLM-4629] [feat] Step1: trtllm-gen kernels support sm103
VALLIS-NERIA 5e7aa76
Merge branch 'user/sm103_trtllmgen' into feat/b300_cu13
VALLIS-NERIA 10af4f4
[TRTLLM-4629] [feat] Step1: trtllm-gen kernels support sm103
VALLIS-NERIA 1d7979a
fix
VALLIS-NERIA 3e71ec7
Merge branch 'user/sm103_trtllmgen' into feat/b300_cu13
VALLIS-NERIA 65f8478
fix trtllm-gen interface change
VALLIS-NERIA bec1e71
fix
VALLIS-NERIA 0b0781f
fix
VALLIS-NERIA 3d4f49e
fix missing gemm kernels
VALLIS-NERIA 1150def
Merge branch 'user/sm103_trtllmgen' into feat/b300_cu13
VALLIS-NERIA d12eb4b
fix CI build archs
VALLIS-NERIA 322db71
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA 8f8766a
waive
VALLIS-NERIA 2912908
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA e6bb1fe
remove non-exist cases
VALLIS-NERIA 77657de
fix build args
VALLIS-NERIA d42201e
remove waivers and cleanup
VALLIS-NERIA caea58a
increase build memory
VALLIS-NERIA d4d9e77
reset build memory
VALLIS-NERIA 019b1db
fix 5505835
VALLIS-NERIA fdaf4e2
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA e30e0c8
waive
VALLIS-NERIA 4cf9fed
Merge commit 'ed27a72bcf71f7ab0e7137f7999988c9de82386f' into feat/b30…
VALLIS-NERIA b573e07
[None][infra] Disable CU12 build to save build time (cost > 5 hours o…
ZhanruiSunCh 82833fa
address comments
VALLIS-NERIA 8cc5ea3
add comment
VALLIS-NERIA a8b630f
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA 2c287d5
don't throw in ctor
VALLIS-NERIA 11d603b
fix
VALLIS-NERIA d16d98c
fix missing change
VALLIS-NERIA 5f508b7
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA 2e61526
fix
VALLIS-NERIA 0b73a57
refine sm version check
VALLIS-NERIA 27c73de
add a line of comment
VALLIS-NERIA b8d1ee6
exclude sm70
VALLIS-NERIA 6133354
fix sm check
VALLIS-NERIA 41d3cf6
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA ced6e74
[None][infra] Remove WAR on feat branch (#7642)
ZhanruiSunCh 98cbab0
[None][infra] Update images (#7690)
ZhanruiSunCh 514ebc2
remove sm70 from fmha_v2 completely
VALLIS-NERIA 9bd8df7
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA ad20048
remove sm72 & 75
VALLIS-NERIA 93195ec
waive
VALLIS-NERIA 98d42f9
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA cf74f40
fix testdb
VALLIS-NERIA d48e82a
fix testdb
VALLIS-NERIA 7657d83
fix
VALLIS-NERIA 0192299
Merge remote-tracking branch 'origin/main' into feat/b300_cu13
VALLIS-NERIA File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Submodule DeepGEMM
updated
36 files
VALLIS-NERIA marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Barry-Delaney could you help to check if this DG is what we want? Thx
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The DG version seems fine.
I tried compiling locally for verification, but the build failed here.
Once it got fixed, I can double check on this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just fixed it in latest commit.