Skip to content

fix: Reduce memory usage in fused moe op associated with AutoTuning and fix moe fallback issue.#3793

Merged
litaotju merged 2 commits intoNVIDIA:release/0.19from
hyukn:fix/reduce_autotune_mem_usage_0.19
Apr 24, 2025
Merged

fix: Reduce memory usage in fused moe op associated with AutoTuning and fix moe fallback issue.#3793
litaotju merged 2 commits intoNVIDIA:release/0.19from
hyukn:fix/reduce_autotune_mem_usage_0.19

Commits

Commits on Apr 23, 2025