Skip to content

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706

Merged
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8
Jun 1, 2026
Merged

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8

fix ci

83c0949
Select commit
Loading
Failed to load commit list.
GitHub Advanced Security / CodeQL succeeded May 29, 2026 in 2s

No new alerts in code changed by this pull request