-
Notifications
You must be signed in to change notification settings - Fork 3.7k
KleidiAI SGEMM/IGEMM/Quantized MatMul - Modular MLAS API Changes for KleidiAI #25187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
hariharans29
merged 53 commits into
microsoft:main
from
damdoo01-arm:kai_sgemm_igemm_quant_gemv
Jul 25, 2025
Merged
Changes from all commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
b8d540d
CLNTFRAME-376: Add initial pipeline setup
JonathanC-ARM d69d3f5
Integrate initial KFI changes
JonathanC-ARM 9e56664
updated build and test to have mac stages
JonathanC-ARM 5c03bcd
Sync with latest from old repo
dflavin-arm 48b09e3
Added Dynamic-Quantized Matmuls and GEMV
damdoo01-arm 972eef5
Fixed copyright attribution
damdoo01-arm baa63df
KFI-51 Requires target "kleidiai" error building ONNX RT on aarch64.
Colm-in-Arm a4068c1
MLAS API updates, mlas test fixes and ORT test fixes
damdoo01-arm 3b34766
Remove Arm CI internal directory inadvertently pushed previously
damdoo01-arm 44199a5
Fix to iOS build
damdoo01-arm 459acf8
2nd attempt to fix ios build by force disabling KAI
damdoo01-arm c675ccd
Wrap preprocessor ifdefs around dedicated KAI lib
damdoo01-arm 8b8e6a0
Lint fixes
damdoo01-arm 91008e9
Added Android/Linux CI build fixes plus fixed a layer parser fix
damdoo01-arm f984e81
Fix to 2 more CI failures. 1. kleidiai dir not visible in some builds…
damdoo01-arm 25e9815
Remove badly named directory
damdoo01-arm 3b9fd9a
Renamed kleidiai dir in lower case
damdoo01-arm fa558b3
Merge branch 'main' into kai_sgemm_igemm_quant_gemv
damdoo01-arm db144a0
QGemm call fixes that resolve the failing tests in CPU_U8S8_Precision…
38083e6
RESTRICT define created to overcome syntactical differences between c…
damdoo01-arm 8285fcf
Resolve conflicts in mlasi.h
damdoo01-arm 967e337
Added support to restrict keyword for msvc in KAI convolve function
damdoo01-arm ff5c2ea
Attempt to fix MSVC build by correctly linking kleidiai static library
damdoo01-arm 8e6f4ea
Syntax fix on prev
damdoo01-arm d38d63d
Reverting back to where we were before MSVC work
damdoo01-arm dead330
Staged for MSVC changes (now Linux/MacOS compatible)
damdoo01-arm 13a1c2b
Workaround pending KleidiAI release
damdoo01-arm 8471aa4
Merge branch 'microsoft:main' into kai_sgemm_igemm_quant_gemv
damdoo01-arm 6da3217
Merge branch 'microsoft:main' into kai_sgemm_igemm_quant_gemv
damdoo01-arm 34ec660
Reverted to KleidiAI 1.9 removing SME support for MSVC. Fixed duktape…
damdoo01-arm be0b6bf
KleidiAI build in Android
damdoo01-arm b82f64b
Checkpointing lint fixes after android enablement and addressing comm…
damdoo01-arm 810ee40
Added MSVC 1944 flag and fixed syntax issue on quantize matmul c++ ca…
damdoo01-arm 6ef54d9
Reverted suggestion for 1944 gate, not possible without KAI release
damdoo01-arm 1169ccc
Most of the responses to comments (small number outstanding)
damdoo01-arm db2feff
Unit test updates and lint fixes (more to follow)
damdoo01-arm 1aa31f9
DynQuantMatMul enhancements
damdoo01-arm 62175e8
Corrected typo
damdoo01-arm 8e277e3
Added further comments
damdoo01-arm 72dd877
Lint fix
damdoo01-arm 795ed28
Temporary commit to check changes
damdoo01-arm baa2eb1
Refactor overrides for kleidi func calls and included fix for matmul …
damdoo01-arm 9c7360e
Merge branch 'microsoft:main' into kai_sgemm_igemm_quant_gemv
damdoo01-arm 00fda4a
Reinstated SME checks where still required
damdoo01-arm 4379912
Added Dynamic Quantization Unit Test
damdoo01-arm 74e8bf1
Fixed lint issues
damdoo01-arm e9c7291
Further updates to comments, dyn quant matmul changes only pending
damdoo01-arm b201e5e
Dynamic Quant Matmul changes based on feed back
damdoo01-arm dba3f0b
Tidy-up based on MS feedback
damdoo01-arm 52961ea
Lint fixes
damdoo01-arm 8523708
Added Jonathan's guard conditions for DynQuantMatMul
damdoo01-arm d45c6bb
Fixed unused variable error after guard include
damdoo01-arm 4753512
Removed global variable and fixed transA override (removed inadverten…
damdoo01-arm File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.