Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)" by ko3n1g · Pull Request #2457 · NVIDIA-NeMo/Megatron-Bridge

ko3n1g · 2026-02-20T09:46:57Z

This reverts commit d3d8030.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Summary by CodeRabbit

Chores
- Streamlined large-scale workload configuration by removing explicit parallelism and CUDA graph scope parameters, simplifying performance tuning settings.

This reverts commit d3d8030.

coderabbitai · 2026-02-20T09:49:31Z

No actionable comments were generated in the recent review. 🎉

📝 Walkthrough

Walkthrough

A configuration parameter removal from the QWEN3 large-scale pretrain variant, reducing explicit settings for virtual pipeline model parallelism, expert model parallelism, and CUDA graph scoping, while retaining the global batch size override.

Changes

Cohort / File(s)	Summary
QWEN3 Configuration Update `scripts/performance/configs/qwen/qwen3_workload_base_configs.py`	Removed three parameters (`virtual_pipeline_model_parallel_size=12`, `expert_model_parallel_size=16`, `cuda_graph_scope=["moe_router", "moe_preprocess"]`) from `QWEN3_235B_A22B_PRETRAIN_CONFIG_GB300_FP8_MX_LARGE_SCALE` replace() call, leaving only `global_batch_size=512` override.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

Revert Qwen3 235B GB300 MXFP8 large scale mapping #2338: Directly related—modifies the same config with inverse changes, adding the parallelism and CUDA graph scope settings that this PR removes.
Ko3n1g/chore/reapply 2152 and 2209 #2273: Related configuration updates to QWEN3 workload configs affecting the same parallelism and CUDA graph parameters across GB300/GB200 variants.

Suggested labels

r0.3.0

Suggested reviewers

erhoo82
dingqingy-nv

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR removes critical parallelism and CUDA graph configuration parameters from a large-scale training preset without test results, performance metrics, or evidence in the description.	Update PR description with test results, performance comparisons, and explanation of why this revert is necessary and whether it prevents regressions.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the changeset as a revert of a previous revert, specifically relating to Qwen3 235B GB300 MXFP8 large scale mapping configuration.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch ko3n1g/revert/d3d8030fac4ea69cc228151353c9288acbf2fe6f

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…o#2338)" (NVIDIA-NeMo#2457) Signed-off-by: pengdurice <pengduhit@gmail.com>

…2457)

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)"

d801c29

This reverts commit d3d8030.

ko3n1g requested a review from dingqingy-nv February 20, 2026 09:47

copy-pr-bot bot temporarily deployed to nemo-ci February 20, 2026 09:47 Inactive

copy-pr-bot bot temporarily deployed to test February 20, 2026 09:47 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 20, 2026 09:57 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 20, 2026 10:05 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 20, 2026 10:16 Inactive

dingqingy-nv approved these changes Feb 20, 2026

View reviewed changes

ko3n1g merged commit 8f4439c into main Feb 20, 2026
55 of 56 checks passed

ko3n1g deleted the ko3n1g/revert/d3d8030fac4ea69cc228151353c9288acbf2fe6f branch February 20, 2026 18:35

pengdurice pushed a commit to pengdurice/Megatron-Bridge that referenced this pull request Feb 24, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (NVIDIA-NeM…

afd7d24

…o#2338)" (NVIDIA-NeMo#2457) Signed-off-by: pengdurice <pengduhit@gmail.com>

copy-pr-bot bot pushed a commit that referenced this pull request Mar 19, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)" (#…

af51a88

…2457)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)"#2457

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)"#2457
ko3n1g merged 1 commit intomainfrom
ko3n1g/revert/d3d8030fac4ea69cc228151353c9288acbf2fe6f

ko3n1g commented Feb 20, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 20, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ko3n1g commented Feb 20, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 20, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ko3n1g commented Feb 20, 2026 •

edited by coderabbitai bot

Loading