Skip to content

[Kernel] Switch fp8 layers to use the CUTLASS kernels#5183

Merged
pcmoritz merged 9 commits intovllm-project:mainfrom
neuralmagic:tms/use_cutlass_4_fp8
Jun 7, 2024
Merged

[Kernel] Switch fp8 layers to use the CUTLASS kernels#5183
pcmoritz merged 9 commits intovllm-project:mainfrom
neuralmagic:tms/use_cutlass_4_fp8

Commits

Commits on Jun 1, 2024

Commits on Jun 3, 2024

Commits on Jun 6, 2024