Commit 36ff1f1
authored
File tree
4 files changed
+21
-14
lines changed- 3rdparty
- src/runtime/relax_vm
- tests/python/relax
4 files changed
+21
-14
lines changedSubmodule flashinfer updated 72 files
- .github/workflows/release_wheel.yml+8-1
- .release-please-manifest.json+1-1
- CHANGELOG.md+50-1
- CMakeLists.txt+61-11
- README.md+12-2
- cmake/config.cmake+2
- docs/api/python/cascade.rst+4
- docs/api/python/sampling.rst+3
- docs/conf.py+2-2
- docs/installation.rst+1-1
- docs/tutorials/kv_layout.rst+18
- include/flashinfer/activation.cuh+74
- include/flashinfer/attention/cascade.cuh+85-66
- include/flashinfer/attention/decode.cuh+22-37
- include/flashinfer/attention/handler.cuh+86-79
- include/flashinfer/attention/prefill.cuh+507-368
- include/flashinfer/frag_layout_swizzle.cuh+9-8
- include/flashinfer/mma.cuh+78-2
- include/flashinfer/norm.cuh+115-17
- include/flashinfer/permuted_smem.cuh+62-13
- include/flashinfer/prefill_attention_decl.cuh+19-18
- include/flashinfer/sampling.cuh+504-165
- include/flashinfer/utils.cuh+6
- include/flashinfer/vec_dtypes.cuh+313-235
- python/csrc/activation.cu+60
- python/csrc/batch_decode.cu+25-15
- python/csrc/batch_prefill.cu+222-105
- python/csrc/flashinfer_ops.cu+6-37
- python/csrc/flashinfer_ops.h+33-126
- python/csrc/flashinfer_ops_decode.cu+32
- python/csrc/flashinfer_ops_decode.h+59
- python/csrc/flashinfer_ops_prefill.cu+47
- python/csrc/flashinfer_ops_prefill.h+96
- python/csrc/norm.cu+46-16
- python/csrc/pytorch_extension_utils.h+4
- python/csrc/sampling.cu+176-35
- python/csrc/single_decode.cu+1-1
- python/csrc/single_prefill.cu+15-3
- python/flashinfer/__init__.py+28-23
- python/flashinfer/activation.py+102
- python/flashinfer/cascade.py+280-21
- python/flashinfer/decode.py+41-24
- python/flashinfer/group_gemm.py+50-1
- python/flashinfer/norm.py+27-6
- python/flashinfer/page.py+3-3
- python/flashinfer/prefill.py+97-46
- python/flashinfer/quantization.py+22
- python/flashinfer/sampling.py+489-41
- python/flashinfer/sparse.py+37-10
- python/generate_batch_paged_prefill_inst.py+8-6
- python/generate_batch_ragged_prefill_inst.py+8-6
- python/generate_single_prefill_inst.py+7-5
- python/setup.py+75-46
- python/tests/test_activation.py+45
- python/tests/test_fp8_prefill.py+208
- python/tests/test_norm.py+34-3
- python/tests/test_sampling.py+179-26
- python/tests/test_shared_prefix_kernels.py+40-51
- src/bench_batch_decode.cu+24-14
- src/bench_batch_prefill.cu+11-6
- src/bench_cascade.cu+36-20
- src/bench_sampling.cu+4-4
- src/bench_single_prefill.cu+77-1
- src/flashinfer_ops.cuh+15-13
- src/test_batch_decode.cu+6-3
- src/test_batch_prefill.cu+220-122
- src/test_cascade.cu+28-22
- src/test_fast_dequant.cu+71
- src/test_sampling.cu+2-2
- src/test_single_prefill.cu+108-15
- src/tvm_wrapper.cu+47-32
- version.txt+1-1
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
57 | 57 | | |
58 | 58 | | |
59 | 59 | | |
60 | | - | |
61 | | - | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
62 | 64 | | |
63 | 65 | | |
64 | 66 | | |
| |||
915 | 917 | | |
916 | 918 | | |
917 | 919 | | |
918 | | - | |
| 920 | + | |
| 921 | + | |
919 | 922 | | |
920 | 923 | | |
921 | 924 | | |
| |||
1089 | 1092 | | |
1090 | 1093 | | |
1091 | 1094 | | |
1092 | | - | |
1093 | | - | |
| 1095 | + | |
| 1096 | + | |
1094 | 1097 | | |
1095 | 1098 | | |
1096 | 1099 | | |
| |||
1103 | 1106 | | |
1104 | 1107 | | |
1105 | 1108 | | |
1106 | | - | |
1107 | | - | |
| 1109 | + | |
| 1110 | + | |
| 1111 | + | |
| 1112 | + | |
1108 | 1113 | | |
1109 | 1114 | | |
1110 | 1115 | | |
| |||
2324 | 2329 | | |
2325 | 2330 | | |
2326 | 2331 | | |
2327 | | - | |
| 2332 | + | |
| 2333 | + | |
2328 | 2334 | | |
2329 | 2335 | | |
2330 | 2336 | | |
| |||
2336 | 2342 | | |
2337 | 2343 | | |
2338 | 2344 | | |
2339 | | - | |
| 2345 | + | |
| 2346 | + | |
2340 | 2347 | | |
2341 | 2348 | | |
2342 | 2349 | | |
2343 | 2350 | | |
2344 | 2351 | | |
2345 | | - | |
2346 | | - | |
| 2352 | + | |
| 2353 | + | |
2347 | 2354 | | |
2348 | 2355 | | |
2349 | 2356 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
324 | 324 | | |
325 | 325 | | |
326 | 326 | | |
327 | | - | |
| 327 | + | |
328 | 328 | | |
329 | 329 | | |
330 | 330 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
111 | 111 | | |
112 | 112 | | |
113 | 113 | | |
114 | | - | |
| 114 | + | |
115 | 115 | | |
116 | 116 | | |
117 | 117 | | |
| |||
0 commit comments