-
Notifications
You must be signed in to change notification settings - Fork 444
feat(cutedsl): add CuTeDSL backend #1421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+6,558
−103
Merged
Changes from all commits
Commits
Show all changes
64 commits
Select commit
Hold shift + click to select a range
5617be1
feat: CuTeDSL backend
lucifer1004 98fd67a
fix: clang-tidy
lucifer1004 1a94479
fix: clang-format
lucifer1004 cdd8b61
fix: ci
lucifer1004 f58ca73
fix: revert example gemm fp8
lucifer1004 4154b79
fix: remove duplicate code
lucifer1004 2e9c5e4
fix: switch-case
lucifer1004 c09af46
fix: fp16 silence
lucifer1004 375a389
fix: TVM IR print
lucifer1004 6e09ec5
fix: useless tir
lucifer1004 9e5355b
fix: clang-format
lucifer1004 c7319c5
fix: remove tilelang/contrib/cutedsl/.gitignore
lucifer1004 f3362bc
fix: use hexfloat
lucifer1004 7d4cef6
fix: gsym guard
lucifer1004 702c1b1
fix: unknown storage sync type
lucifer1004 1447851
fix: string literal
lucifer1004 35b7a2a
fix: add args guard
lucifer1004 4ca27bc
fix: name hint dedup
lucifer1004 1680ed5
fix: better find_kernel_by_pattern
lucifer1004 76354cb
fix: set libpath for from_database path
lucifer1004 081de86
fix: guard buffer.strides
lucifer1004 5252482
fix: from guard
lucifer1004 4238d9e
fix: eviction guard
lucifer1004 45fc89a
fix: use thread local tma descs
lucifer1004 c510459
fix: ruff
lucifer1004 a5a721e
fix: drop tma_init_cpp
lucifer1004 1465ff9
fix: exc_info
lucifer1004 51bdb1f
fix: negative unmatch early return
lucifer1004 e6efdd9
fix: rename postproc func and add test
lucifer1004 9246673
fix: handle fast math according to pass config
lucifer1004 25c2d02
fix: dyn_sym parse
lucifer1004 cf01c04
fix: wrap_forward
lucifer1004 b6987f5
fix: use tvm_ffi.libinfo instead of cli
lucifer1004 b317a03
fix: keep signature
lucifer1004 b4fbe8e
fix: C++ string safety
lucifer1004 17b0696
fix: mark tma_store_add as unsupported
lucifer1004 6ea6c8e
fix: tvm version
lucifer1004 402ca6a
resolve ldsm and cpasync issues.
cherichy c98a080
fix: minor fixes
lucifer1004 2b16bef
fix: parse signature using ast
lucifer1004 1a5139f
fix: guard global_addr
lucifer1004 b22a82c
fix: create tempfile only when necessary
lucifer1004 7a4bcc8
fix: use logger.execption for exceptions
lucifer1004 336d387
fix: guard lib_path and host_func
lucifer1004 f3d8029
fix: remove tma_cpp_init and add timeout for cpp compile
lucifer1004 9866d88
add timeout for mbarrier_wait.
cherichy 95e5616
fix: _load_kernel_from_disk signature
lucifer1004 4533c34
resolve codegen issues.
cherichy 530ca5e
fix: logger.exception
lucifer1004 f196e37
add comment for div_by=1
f1eb843
merge
lucifer1004 fbe1cf3
fix: reserve cutlass,cute,tl
lucifer1004 82a2968
fix: guard tma_store
lucifer1004 8a1011e
fix: allow int64 offset in make_tensor_at_offset
lucifer1004 b777c99
fix: guard barrier
lucifer1004 9021d8c
fix: add comments for div_by=16
lucifer1004 fd15d08
fix: div_by=1 issue
lucifer1004 a2f87b3
delete div_by when offset is 0
f996239
use tl.make_tensor when offset is 0
ede17db
fix: explicitly check cutedsl target
lucifer1004 7500841
Merge remote-tracking branch 'upstream/main' into github-cutedsl
lucifer1004 32fe9b4
fix: use param.torch_dtype()
lucifer1004 0575f11
Merge branch 'main' into github-cutedsl
lucifer1004 043ba95
Merge branch 'main' of https://github.com/tile-ai/tilelang into githu…
LeiWang1999 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.