Skip to content

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706

Merged
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8
Jun 1, 2026
Merged

[Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement#43706
yewentao256 merged 7 commits into
mainfrom
wentao-optimize-cutlassfp8

Commits

Commits on May 26, 2026

Commits on May 27, 2026

Commits on May 28, 2026

Commits on May 29, 2026