Skip to content

[Fix] Fix FlashInfer CUTLASS MoE for unquantized models and single-GPU, bump FlashInfer to 0.6.8#38215

Draft
askliar wants to merge 5 commits intovllm-project:mainfrom
askliar:feature/add_nemotronh_spark_support
Draft

[Fix] Fix FlashInfer CUTLASS MoE for unquantized models and single-GPU, bump FlashInfer to 0.6.8#38215
askliar wants to merge 5 commits intovllm-project:mainfrom
askliar:feature/add_nemotronh_spark_support

Commits

Commits on Mar 26, 2026