Skip to content

Fix blockwise FP8 cast main weight to model weight shard#7

Merged
kunlunl merged 2 commits intokunlunl:kunlunl/megatron-fsdp-fp8-paramsfrom
shjwudp:megatron-fsdp-fp8-params-jianbinc-dec16_v2
Dec 17, 2025
Merged

Fix blockwise FP8 cast main weight to model weight shard#7
kunlunl merged 2 commits intokunlunl:kunlunl/megatron-fsdp-fp8-paramsfrom
shjwudp:megatron-fsdp-fp8-params-jianbinc-dec16_v2

Commits

Commits on Dec 16, 2025