Skip to content

fix: move mPtrRoutingReplayOut to end of routing structs#4

Merged
TomerBN-Nvidia merged 1 commit intoupstream-routing-replayfrom
fix-routing-struct-layout
Apr 14, 2026
Merged

fix: move mPtrRoutingReplayOut to end of routing structs#4
TomerBN-Nvidia merged 1 commit intoupstream-routing-replayfrom
fix-routing-struct-layout

Conversation

@TomerBN-Nvidia
Copy link
Copy Markdown
Owner

Move field from middle to end of DataBase and KernelParamsBase structs to preserve existing field offsets. Fixes FP8/FP4 autotuner GEMM crashes (11/15 failures). Also adds missing routing_replay_out arg to MXINT4/FP4 autotuner paths.

Inserting the field in the middle of DataBase and KernelParamsBase
shifted memory offsets for all subsequent fields, causing GEMM crashes
in FP8/FP4 autotuner tests (11/15 failures). Moving to end preserves
the original layout for all existing fields.

Also adds missing routing_replay_out arg to MXINT4 and FP4 paths
in MoERunner._run().

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@TomerBN-Nvidia TomerBN-Nvidia merged commit db1855e into upstream-routing-replay Apr 14, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant