-
Notifications
You must be signed in to change notification settings - Fork 19.7k
sycl: Battlemage AOT build via spir64_gen + MMQ subgroup annotations #22147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 2 commits
4065b65
dbbf560
21ed9b3
d1313cf
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -2178,6 +2178,8 @@ inline void ggml_sycl_op_mul_mat_sycl( | |
| #endif | ||
| if ((src0->type == GGML_TYPE_F16 || ggml_is_quantized(src0->type)) && use_fp16 && ggml_is_contiguous(src0) && | ||
| row_diff == src0->ne[1] && dst->op_params[0] == GGML_PREC_DEFAULT) { | ||
| // NOTE: Fused dequant+GEMM and MMQ/DPAS were both attempted (Steps 10-11 | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. remove these lines.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Removed |
||
| // in optimization-workbook.md) but are slower than dequant+oneDNN. | ||
| ggml_sycl_pool_alloc<sycl::half> src0_as_f16(ctx.pool()); | ||
| if (src0->type != GGML_TYPE_F16) { | ||
| scope_op_debug_print scope_dbg_print(__func__, "/to_fp16_sycl", dst, /*num_src=*/2, | ||
|
|
@@ -3261,9 +3263,12 @@ enum class mul_mat_algo { | |
| }; | ||
|
|
||
| inline bool ggml_sycl_supports_mmq(enum ggml_type type) { | ||
| // TODO: accuracy issues in MMQ | ||
| GGML_UNUSED(type); | ||
| return false; | ||
| // DPAS INT8 MMQ kernel exists in mmq.cpp but is slower than dequant+oneDNN. | ||
| // Disabled pending further optimization. See optimization-workbook.md Step 11. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This code doesn't change any behavior.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Removed |
||
| switch (type) { | ||
| default: | ||
| return false; | ||
| } | ||
| } | ||
|
|
||
| inline bool ggml_sycl_supports_reorder_mul_mat_sycl(enum ggml_type type) { | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.