Skip to content

Commit ddf7bce

Browse files
authored
Upgrade to CUTLASS 4.2.1 (#18372)
* Upgrade to CUTLASS 4.2.1 * Fix test: mbarrier.try_wait requires .target sm_90 or higher
1 parent 2fd72ab commit ddf7bce

File tree

3 files changed

+3
-4
lines changed

3 files changed

+3
-4
lines changed

3rdparty/cutlass

Submodule cutlass updated 221 files

tests/python/tir-base/test_tir_ptx_cp_async.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -95,7 +95,7 @@ def ptx_cp_async_barrier(
9595
B[tx, i] = A_shared[tx, i]
9696

9797

98-
@pytest.mark.xfail(reason="temp skip test due to cuda env update")
98+
@tvm.testing.requires_cuda_compute_version(9)
9999
def test_ptx_cp_async_barrier():
100100
f = ptx_cp_async_barrier
101101

tests/python/tir-transform/test_tir_transform_inject_ptx_async_copy.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -214,8 +214,7 @@ def ptx_global_to_shared_copy_fp32x1_barrier(
214214
B[tx, i] = A_shared[tx, i]
215215

216216

217-
@pytest.mark.xfail(reason="temp skip test due to cuda env update")
218-
@tvm.testing.requires_cuda
217+
@tvm.testing.requires_cuda_compute_version(9)
219218
def test_inject_async_copy_barrier():
220219
dtype = "float32"
221220
vec_size = 1

0 commit comments

Comments
 (0)