Skip to content

Test/test workflow#7

Merged
diptorupd merged 2 commits intoamd-integrationfrom
test/test-workflow
Jan 26, 2026
Merged

Test/test workflow#7
diptorupd merged 2 commits intoamd-integrationfrom
test/test-workflow

Conversation

@diptorupd
Copy link
Owner

Test workflow

@diptorupd diptorupd merged commit f77bc9d into amd-integration Jan 26, 2026
1 check passed
diptorupd pushed a commit that referenced this pull request Jan 28, 2026
In this PR I remove the `libtorch` dependency and removed
`test_page.cpp`. `test_page.cpp` is the only unit test that uses
libtorch. However, we also have a pytest for testing page. We will use
that for validation.

Removing the libtorch dependency will help us speed docker builds and
remove additional dependencies.


```Test project /root/flashinfer/libflashinfer/tests/hip/build
    Start 1: MathTest
1/8 Test #1: MathTest ............................   Passed    0.31 sec
    Start 2: PosEncTest
2/8 Test #2: PosEncTest ..........................   Passed    0.31 sec
    Start 3: CascadeTest
3/8 Test #3: CascadeTest .........................   Passed  1369.12 sec
    Start 4: SingleDecodeTest
4/8 Test #4: SingleDecodeTest ....................   Passed  7726.35 sec
    Start 5: BatchDecodeTest
5/8 Test #5: BatchDecodeTest .....................   Passed  811.61 sec
    Start 6: test_mfma_fp32_16x16x16fp16
6/8 Test #6: test_mfma_fp32_16x16x16fp16 .........   Passed    0.30 sec
    Start 7: test_transpose_4x4_half_registers
7/8 Test #7: test_transpose_4x4_half_registers ...   Passed    0.28 sec
    Start 8: test_rowsum
8/8 Test #8: test_rowsum .........................   Passed    0.27 sec

100% tests passed, 0 tests failed out of 8
```
diptorupd added a commit that referenced this pull request Jan 28, 2026
The mask values for the __shfl_xor_sync used in the original CUDA version of the frag_layout_swizzle.cuh was designed for 32-thread warps. The design made the header incompatible with CDNA3 that has 64-thread warps. The PR adds a platform-specific
WARP_FULL_MASK constant to support both 32 thread and 64 thread warps.
@diptorupd diptorupd deleted the test/test-workflow branch February 3, 2026 23:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant