Skip to content

[V4] Apply swiglu_limit clamp to DeepseekV2MLP shared/dense MLP

beceaf0
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

[DeepSeek V4] Fix meaningless numbers in chat output by adding swiglu_limit clamp to DeepseekV2MLP #23776

[V4] Apply swiglu_limit clamp to DeepseekV2MLP shared/dense MLP
beceaf0
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
label
succeeded Apr 26, 2026 in 4s