test: Add grpo-qwen3-30ba3b-4n8g-40k config to performance test suite. by sfawzy-nv · Pull Request #1623 · NVIDIA-NeMo/RL

sfawzy-nv · 2025-12-11T00:05:11Z

What does this PR do ?

Add grpo-qwen3-30ba3b-4n8g-128k config to performance test suite.

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Summary by CodeRabbit

New Features
- Added GRPO experiment configuration for performance testing with Qwen3-30B model, featuring Megatron-like parallelism and comprehensive logging.
- Introduced new performance test suite with TensorBoard metrics conversion and automated loss validation.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-11T00:11:24Z

📝 Walkthrough

Walkthrough

Introduces a new YAML configuration file for GRPO performance testing with Qwen3-30B-A3B model, along with a corresponding shell script test that executes the performance experiment and registers it in the test manifest.

Changes

Cohort / File(s)	Summary
GRPO Performance Configuration `examples/configs/recipes/llm/performance/grpo-qwen3-30ba3-4n8g-128K.yaml`	New YAML configuration file defining GRPO experiment parameters including Megatron-like parallelism settings (tensor/model parallelism), VLLM generation config, logging (WandB/TensorBoard), cluster GPU allocation, and model-specific training parameters.
GRPO Performance Test `tests/test_suites/llm/performance/grpo-qwen3-30ba3b-4n8g-128K.sh`	New shell script for performance testing that defines experiment parameters (num nodes, steps, runs), executes the GRPO experiment via `uv run`, converts TensorBoard logs to JSON, and conditionally runs metrics checks.
Test Manifest `tests/test_suites/performance.txt`	Single-line addition registering the new performance test script path.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

New configuration file: verify schema correctness and parameter alignment with existing GRPO configs
New test script: confirm proper environment variable usage, step calculation logic, and conditional metrics evaluation
No complex logic or structural modifications; primarily configuration and test orchestration

Possibly related PRs

feat: Onboard perf recipes in tests #1322: Adds identical GRPO performance recipe and test script for the same grpo-qwen3-30ba3b-4n8g configuration setup.
cp: feat: Onboard perf recipes in tests (1322) into r0.4.0 #1497: Introduces related GRPO LLM performance recipes and test scripts in the same configuration and test directories.
perf: [Perf script] QWEN3 30B-A3B tensor_parallel_size from 4 to 2 #1558: Modifies tensor_parallel_size setting in similar Qwen3-30B-A3B GRPO configurations.

Suggested labels

Performance, Run CICD

Suggested reviewers

guyueh1
terrykong

🚥 Pre-merge checks | ✅ 2 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR adds new GRPO performance test configuration for Qwen3-30B-A3B-128K without documenting test results, baselines, or convergence validation.	Add test results demonstrating successful execution, baseline performance metrics, convergence validation, and comparison with existing configuration to confirm no regressions.
Title check	⚠️ Warning	The PR title mentions '40k' but the actual changes reference '128K', creating a discrepancy with the file contents.	Update the PR title to 'test: Add grpo-qwen3-30ba3b-4n8g-128K config to performance test suite.' to accurately reflect the actual configuration being added.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5bc5eba and dbe1d97.

📒 Files selected for processing (3)

examples/configs/recipes/llm/performance/grpo-qwen3-30ba3b-4n8g-128K.yaml (1 hunks)
tests/test_suites/llm/performance/grpo-qwen3-30ba3b-4n8g-128K.sh (1 hunks)
tests/test_suites/performance.txt (1 hunks)

🧰 Additional context used

📓 Path-based instructions (5)

examples/configs/recipes/**/*.yaml