cpu:ppc64: fix GEMM reorder build issue on Power system#4002
cpu:ppc64: fix GEMM reorder build issue on Power system#4002vpirogov merged 3 commits intouxlfoundation:mainfrom
Conversation
0d66ac0 to
ab697b0
Compare
ab697b0 to
d3e1264
Compare
|
Could you, please, elaborate this bullet?
I don't see changes to the build system. How this compatibility was achieved? |
|
@Tiwari-Avanish Our conda build succeeds now. For future reference to test it you can run: Currently the build shows 2 warnings that need to be fixed: which is quite obvious, and which I assume is just a typo since in all other Please also check if the performance change is in line with your expectations on Power10 and there is no degradation in case of Power8/9. You do not need to run the entire perf test suite, since you tested that last time, just a smaller subset. |
|
Hi @dzarukin Thanks for reviewing this PR. By “compatibility with all supported Power architectures,” I meant source-level compatibility:
So “compatibility” means correct compilation and execution across Power7–Power10 using conditional compilation, not build-system modifications. Thanks @spalicki for reviewing this and checking the conda build system.
Previously, I ran these tests during performance validation, so I can rerun them if needed. My assumption is that Power8/9 performance should remain unchanged, but I’m happy to test if you’d like confirmation. |
2aea3d4 to
7c8e525
Compare
|
Hi @spalicki, I’ve fixed the warning and corrected the typo. Top 15 Benchmark improvements for perf_matmul_inference_lb (earlier vs current changes): Click to expand
Whenever you have a moment, please review this PR. If everything looks good, kindly help merge it. |
Yes, that is why I asked. Power 8/9 should remain unchanged with Power10 getting an uplift. |
Macro is always defined, must have a value of 1 instead.
7c8e525 to
aca5334
Compare
|
Cherry-picked to |
My previous changes #3156 got reverted back because of build issue on different power system.
This PR addresses the build issues for the GEMM reorder kernels on ppc64 architectures
including Power8, Power9, and Power10.
The changes include:
Test Results:
Power10:
Power 9:
Power8:
Related PRs:
Power8 compatibility fixes: PR #3968
Saved GEMM/reorder changes: PR #3974
Thanks to @vpirogov and @spalicki for helping identify and preserve the relevant changes.
Please review this PR. All changes have been tested locally across Power8, Power9, and Power10.