Skip to content

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706

Merged
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8
Jun 1, 2026
Merged

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8

fix ci

83c0949
Select commit
Loading
Failed to load commit list.
Meta CodeSync / Meta Internal-Only Changes Check succeeded May 29, 2026 in 0s

There is no internal Diff connected, this can be merged now