Skip to content

[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static scales#4343

Merged
robertgshaw2-redhat merged 33 commits intovllm-project:mainfrom
pcmoritz:mixtral-fp8-static
Apr 27, 2024
Merged

[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static scales#4343
robertgshaw2-redhat merged 33 commits intovllm-project:mainfrom
pcmoritz:mixtral-fp8-static

Commits

Commits on Apr 24, 2024

Commits on Apr 25, 2024

Commits on Apr 26, 2024