[μKernels]: lowering based on m and n tile size #1068

arun-thmn · 2025-07-08T07:04:25Z

This PR update a small logic in the micro-kernels lowering based on m and n tile size. If:

m >= n - first load all B matrix elements, then broadcast A one-by-one + do fma.
n > m - do the opposite. First broadcast all A matrix elements, then load B one-by-one + do fma

The logic is updated for fp32 (both avx512 & avx2) and bf16 (only avx512).
Bf16 avx2 will be done later after fixing the llvm pattern matching problem on ADL machine.

adam-smnk

Judging by the tests looks fine.

To be honest, I'm getting lost in all the branches here 😅
Perhaps it could be simplified if you created all the needed ops first then reshuffled them using rewrite.moveOp....
But ultimately as you prefer, as long as you know what's going on. 🙂

lib/TPP/Transforms/VectorContractToMicroKernels.cpp

arun-thmn · 2025-07-08T12:50:56Z

Judging by the tests looks fine.

To be honest, I'm getting lost in all the branches here 😅 Perhaps it could be simplified if you created all the needed ops first then reshuffled them using rewrite.moveOp.... But ultimately as you prefer, as long as you know what's going on. 🙂

True, @adam-smnk.
This pass requires lot of conditional branches. There are few TODOs for this pass like i8 support and want to finish then first. Afterwards, definitely will try to simply this one.

Lowering based on m and n tile size

1afdb43

arun-thmn added the benchmark-full Benchmark all targets label Jul 8, 2025

clang-format fix

f4828e9

arun-thmn marked this pull request as ready for review July 8, 2025 07:34

arun-thmn requested review from adam-smnk and shahidact July 8, 2025 07:35

adam-smnk approved these changes Jul 8, 2025

View reviewed changes

adam-smnk reviewed Jul 8, 2025

View reviewed changes

lib/TPP/Transforms/VectorContractToMicroKernels.cpp Show resolved Hide resolved

added an explanation

f9dabe6

arun-thmn merged commit 4a95804 into libxsmm:main Jul 8, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[μKernels]: lowering based on m and n tile size #1068

[μKernels]: lowering based on m and n tile size #1068

Uh oh!

arun-thmn commented Jul 8, 2025

Uh oh!

adam-smnk left a comment

Uh oh!

Uh oh!

arun-thmn commented Jul 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[μKernels]: lowering based on m and n tile size #1068

[μKernels]: lowering based on m and n tile size #1068

Uh oh!

Conversation

arun-thmn commented Jul 8, 2025

Uh oh!

adam-smnk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arun-thmn commented Jul 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants