Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
50 commits
Select commit Hold shift + click to select a range
2440764
Init
jeejeelee Mar 12, 2026
4e1712b
Backup
jeejeelee Mar 13, 2026
2788b44
Generation LGTM
jeejeelee Mar 14, 2026
10323a9
Backup
jeejeelee Mar 14, 2026
faac03c
Support torch.compile
jeejeelee Mar 14, 2026
1683c3d
cleanup code
jeejeelee Mar 14, 2026
7ba290a
Add testing
jeejeelee Mar 14, 2026
69f231c
Fix
jeejeelee Mar 14, 2026
6ac265d
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 15, 2026
d1dce0b
Cleanup
jeejeelee Mar 15, 2026
8112fbc
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 16, 2026
db53487
Modify logic
jeejeelee Mar 17, 2026
83f6d1a
Skip qk copy
jeejeelee Mar 18, 2026
dc9d427
Modify kernel input
jeejeelee Mar 19, 2026
9066f23
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 19, 2026
8cec181
Move
jeejeelee Mar 19, 2026
917de77
Cleanup
jeejeelee Mar 20, 2026
f435537
Move
jeejeelee Mar 20, 2026
ae3661e
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 20, 2026
e30277b
Move
jeejeelee Mar 21, 2026
83ec039
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 23, 2026
9c53b60
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 24, 2026
2556d52
FIX
jeejeelee Mar 26, 2026
2614b47
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 26, 2026
a547d68
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 26, 2026
8b89488
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 28, 2026
dea4641
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 29, 2026
62a5f66
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Mar 30, 2026
cd4d78f
Copy latest imp
jeejeelee Mar 29, 2026
16d6544
Move
jeejeelee Mar 31, 2026
a3f01e7
Add custom op
jeejeelee Apr 2, 2026
7e6aeb7
move
jeejeelee Apr 2, 2026
1822e54
move
jeejeelee Apr 2, 2026
68b74c0
Kernel fusion
jeejeelee Apr 3, 2026
cff5dcb
Kernel fusion done
jeejeelee Apr 3, 2026
9efeba5
Merge remote-tracking branch 'origin/main' into minmax-m2-norm-trtllm…
jeejeelee Apr 4, 2026
b98a1e1
Fix
jeejeelee Apr 4, 2026
fee9e11
Backup
jeejeelee Apr 7, 2026
db28d8d
Delete redundant code
jeejeelee Apr 7, 2026
4bd3ead
Fix test
jeejeelee Apr 7, 2026
2e63a80
FIX
jeejeelee Apr 7, 2026
c9328b2
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Apr 7, 2026
efa9956
Revert model
jeejeelee Apr 7, 2026
1eaca8a
Fix
jeejeelee Apr 8, 2026
741a736
Fix
jeejeelee Apr 8, 2026
685f22c
Fix
jeejeelee Apr 9, 2026
9c225e1
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Apr 9, 2026
c9e8c3f
Merge branch 'main' into minmax-m2-norm-trtllm-kernel
jeejeelee Apr 10, 2026
0b91792
Fix
jeejeelee Apr 9, 2026
8c9e80e
FIX
jeejeelee Apr 10, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 14 additions & 1 deletion .buildkite/test_areas/kernels.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,20 @@ steps:
- tests/kernels/core
- tests/kernels/test_concat_mla_q.py
commands:
- pytest -v -s kernels/core kernels/test_concat_mla_q.py
- pytest -v -s kernels/core --ignore=kernels/core/test_minimax_reduce_rms.py kernels/test_concat_mla_q.py

- label: Kernels MiniMax Reduce RMS Test (2 GPUs)
timeout_in_minutes: 15
num_devices: 2
device: h100
source_file_dependencies:
- csrc/minimax_reduce_rms_kernel.cu
- csrc/minimax_reduce_rms_kernel.h
- vllm/model_executor/layers/mamba/linear_attn.py
- vllm/model_executor/layers/mamba/lamport_workspace.py
- tests/kernels/core/test_minimax_reduce_rms.py
commands:
- pytest -v -s kernels/core/test_minimax_reduce_rms.py

- label: Kernels Attention Test %N
timeout_in_minutes: 35
Expand Down
2 changes: 2 additions & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -307,6 +307,8 @@ set(VLLM_EXT_SRC
"csrc/torch_bindings.cpp")

if(VLLM_GPU_LANG STREQUAL "CUDA")
list(APPEND VLLM_EXT_SRC "csrc/minimax_reduce_rms_kernel.cu")

SET(CUTLASS_ENABLE_HEADERS_ONLY ON CACHE BOOL "Enable only the header library")

# Set CUTLASS_REVISION. Used for FetchContent. Also fixes some bogus messages when building.
Expand Down
Loading
Loading