fix: perf configs 2 by ko3n1g · Pull Request #2393 · NVIDIA-NeMo/Megatron-Bridge

ko3n1g · 2026-02-16T15:57:28Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Summary by CodeRabbit

Chores
- Updated pretraining configuration settings to enable token dispatch improvements across multiple model variants.

Signed-off-by: oliver könig <okoenig@nvidia.com>

coderabbitai · 2026-02-16T16:00:48Z

📝 Walkthrough

Walkthrough

This pull request adds token-level MOE dispatcher configuration to multiple Qwen3 pretraining model configurations. Specifically, cfg.model.moe_token_dispatcher_type is set to "flex" across several configuration functions (gb200, gb300, b300, b200, h100, and variants), complementing existing MOE dispatcher backend settings.

Changes

Cohort / File(s)	Summary
Qwen3 MOE Dispatcher Configuration `scripts/performance/configs/qwen/qwen3_llm_pretrain.py`	Adds token-level MOE dispatcher type assignment (`"flex"`) to multiple Qwen3 pretraining configuration functions alongside existing dispatcher backend settings.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly related PRs

qwen x86 flex backend #2351 — Sets moe_flex_dispatcher_backend configuration in QWEN3 configs, related dispatcher backend setup.
cp: qwen gbs 2x (2280) into r0.3.0 #2369 — Modifies QWEN3 MOE dispatcher configuration, related to dispatcher type settings.

Suggested labels

performance

Suggested reviewers

yaoyu-33
erhoo82

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR adds MOE token dispatcher type configuration changes affecting performance-critical behavior, but PR description lacks test results, performance metrics, or regression analysis.	Update PR description with before-and-after performance metrics and convergence data for affected Qwen3 configurations to validate no regressions.
Title check	❓ Inconclusive	The title 'fix: perf configs 2' is vague and generic, lacking specificity about which performance configurations are being fixed.	Consider a more descriptive title such as 'fix: enable flex token dispatcher for Qwen3 pretrain configs' to clearly communicate the primary change.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Merge Conflict Detection	✅ Passed	✅ No merge conflicts detected when merging into `main`

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch ko3n1g/fix/perf-configs-2

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

scripts/performance/configs/qwen/qwen3_llm_pretrain.py (1)
78-84: ⚠️ Potential issue | 🟠 Major

moe_token_dispatcher_type is not set in the gb300 config for qwen3_235b, unlike all other sibling configs.

Every other qwen3_235b_a22b_pretrain_config_* and qwen3_30b_a3b_pretrain_config_* function explicitly sets cfg.model.moe_token_dispatcher_type (to either "flex" or "alltoall"). This function sets moe_flex_dispatcher_backend but omits the dispatcher type, so it will silently use whatever default the framework provides.

If this is intentional, a comment explaining the omission would help. Otherwise, it likely needs "flex" to match the gb200 variant:
Proposed fix
     cfg.model.moe_flex_dispatcher_backend = base_cfg.moe_flex_dispatcher_backend
+    cfg.model.moe_token_dispatcher_type = "flex"
 
     set_qwen3_common_configs(cfg)
As per coding guidelines: "Do not add arbitrary defaults for configs, be as explicit as possible."

ko3n1g added 4 commits February 16, 2026 10:07

fix: flex for qwen3 30B gb300

6381b98

Signed-off-by: oliver könig <okoenig@nvidia.com>

fix: flex for qwen3 30B gb200

1d9340f

Signed-off-by: oliver könig <okoenig@nvidia.com>

fix b300

0c68cd0

Signed-off-by: oliver könig <okoenig@nvidia.com>

qwen3 235B gb200

ab89719

Signed-off-by: oliver könig <okoenig@nvidia.com>

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 15:57 Inactive

copy-pr-bot bot temporarily deployed to test February 16, 2026 15:58 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 15:59 Inactive

coderabbitai bot reviewed Feb 16, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 16:07 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 16:19 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 16, 2026 16:19 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 16:19 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 16, 2026 16:19 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 16:19 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 16, 2026 16:19 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 16, 2026 16:19 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 17, 2026 10:22 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 17, 2026 10:22 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 17, 2026 10:22 Inactive

This was referenced Feb 22, 2026

Update Qwen3 30B H100 Base Configs with HybridEP #2477

Merged

fix: all2all for qwen3-next H100 #2479

Merged

Add HybridEP support for Qwen3 235B H100 #2566

Open

This was referenced Mar 3, 2026

Misc: Remaining cherry-picks for 26.02.01 #2631

Merged

[model, training] fix: MoE checkpoint export with YaRN RoPE and flex dispatcher #2641

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: perf configs 2#2393

fix: perf configs 2#2393
ko3n1g merged 5 commits intomainfrom
ko3n1g/fix/perf-configs-2

ko3n1g commented Feb 16, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 16, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ko3n1g commented Feb 16, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 16, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ko3n1g commented Feb 16, 2026 •

edited by coderabbitai bot

Loading