Skip to content

Integrated fused RMSNorm with static quantization for post attention

186572a
Select commit
Loading
Failed to load commit list.
Merged

Add Fused RMSNorm + FP8 Per-tensor Static Quantization to Llama 3 Models #789

Integrated fused RMSNorm with static quantization for post attention
186572a
Select commit
Loading
Failed to load commit list.