Revert Qwen3 235B GB300 MXFP8 large scale mapping by dingqingy-nv · Pull Request #2338 · NVIDIA-NeMo/Megatron-Bridge

dingqingy-nv · 2026-02-11T21:48:52Z

What does this PR do ?

Revert mapping for perf regression.

Summary by CodeRabbit

Chores
- Added a new large-scale configuration variant with optimized parallelism layout and enhanced CUDA graph scoping for performance workloads.

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>

coderabbitai · 2026-02-11T21:52:19Z

📝 Walkthrough

Walkthrough

Added a new large-scale Qwen3 235B A22B FP8 MX configuration variant (QWEN3_235B_A22B_PRETRAIN_CONFIG_GB300_FP8_MX_LARGE_SCALE) with custom settings for virtual pipeline parallelism, expert model parallelism, CUDA graph scoping, and batch size.

Changes

Cohort / File(s)	Summary
Qwen3 Large-Scale Config `scripts/performance/configs/qwen/qwen3_workload_base_configs.py`	Added new exported configuration constant extending QWEN3_235B_A22B_PRETRAIN_CONFIG_GB300_FP8_MX_V2 with virtual_pipeline_model_parallel_size=12, expert_model_parallel_size=16, cuda_graph_scope=["moe_router", "moe_preprocess"], and global_batch_size=512 for large-scale variant.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~5 minutes

Possibly related PRs

Update Qwen3 235B A22B MXFP8 GB200/300 recipe and resolve NaN grad norm #2209: Modifies the same Qwen3 GB300 FP8 configuration with potentially conflicting changes to parallelism and cuda_graph_scope parameters.

Suggested labels

performance

Suggested reviewers

ko3n1g
erhoo82

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR addresses performance regression but lacks before-and-after metrics, configuration context, and validation data required by custom check guidelines.	Add before-and-after performance numbers, hardware configuration details, and benchmarking data validating that the revert resolves the performance regression.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Revert Qwen3 235B GB300 MXFP8 large scale mapping' accurately describes the main change: adding a reverted configuration for large-scale Qwen3 235B FP8 MX variant due to performance regression.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

No actionable comments were generated in the recent review. 🎉

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

This reverts commit d3d8030.

…2457)

…o#2338)" (NVIDIA-NeMo#2457) Signed-off-by: pengdurice <pengduhit@gmail.com>

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>

…2457)

revert qwen3 gb300 mxfp8 large scale

b532db3

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>

dingqingy-nv requested a review from ko3n1g February 11, 2026 21:48

dingqingy-nv added the r0.3.0 Cherry-pick label for r0.3.0 release branch label Feb 11, 2026

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 21:49 Inactive

copy-pr-bot bot temporarily deployed to test February 11, 2026 21:49 Inactive

ko3n1g approved these changes Feb 11, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 22:30 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 22:41 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 22:51 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 11, 2026 22:51 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 22:51 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 11, 2026 22:51 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 11, 2026 22:51 Inactive

erhoo82 approved these changes Feb 13, 2026

View reviewed changes

ko3n1g merged commit d3d8030 into NVIDIA-NeMo:main Feb 13, 2026
50 of 53 checks passed

ko3n1g pushed a commit that referenced this pull request Feb 13, 2026

Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)

302c9c4

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

ko3n1g added a commit that referenced this pull request Feb 20, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)"

d801c29

This reverts commit d3d8030.

coderabbitai bot mentioned this pull request Feb 20, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)" #2457

Merged

5 tasks

ko3n1g added a commit that referenced this pull request Feb 20, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)" (#…

8f4439c

…2457)

pengdurice pushed a commit to pengdurice/Megatron-Bridge that referenced this pull request Feb 24, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (NVIDIA-NeM…

afd7d24

…o#2338)" (NVIDIA-NeMo#2457) Signed-off-by: pengdurice <pengduhit@gmail.com>

coderabbitai bot mentioned this pull request Mar 5, 2026

Unify bf16 gb300 qwen3 235b mapping #2670

Merged

copy-pr-bot bot pushed a commit that referenced this pull request Mar 19, 2026

Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)

0b1ca29

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>

copy-pr-bot bot pushed a commit that referenced this pull request Mar 19, 2026

Revert "Revert Qwen3 235B GB300 MXFP8 large scale mapping (#2338)" (#…

af51a88

…2457)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert Qwen3 235B GB300 MXFP8 large scale mapping#2338

Revert Qwen3 235B GB300 MXFP8 large scale mapping#2338
ko3n1g merged 1 commit intoNVIDIA-NeMo:mainfrom
dingqingy-nv:qwen3_gb300_mxfp8_large_scale_revert

dingqingy-nv commented Feb 11, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 11, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dingqingy-nv commented Feb 11, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 11, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dingqingy-nv commented Feb 11, 2026 •

edited by coderabbitai bot

Loading