cp: `feat: add dapo recipe and test (1617)` into `r0.5.0` by chtruong814 · Pull Request #1687 · NVIDIA-NeMo/RL

chtruong814 · 2025-12-22T18:07:31Z

beep boop [🤖]: Hi @ZhiyuLi-Nvidia 👋,

we've cherry picked #1617 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

Summary by CodeRabbit

Tests
- Added performance testing configuration and test suite for DeepSeek v3 671B model training, including environment setup, metric validation, and tensorboard log conversion.
Chores
- Updated performance test manifest with new DeepSeek v3 test script.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

coderabbitai · 2025-12-22T18:11:23Z

📝 Walkthrough

Walkthrough

A new DAPO DeepSeek v3 performance configuration and test script are introduced for a 64-node, 8-GPU setup. The YAML defines training parameters, model settings, loss functions, and cluster topology. A corresponding Bash test script launches the experiment and validates training metrics. The test suite manifest is updated to register the new test.

Changes

Cohort / File(s)	Change Summary
Configuration `examples/configs/recipes/llm/performance/dapo-deepseek-v3-64n8g.yaml`	New YAML configuration file specifying DAPO defaults, loss function parameters, policy/Megatron model settings, checkpointing, data limits, environment worker configuration, logging (GPU monitoring, WandB, MLflow), and cluster topology for 64-node, 8-GPU DeepSeek v3 671B setup.
Test Script `tests/test_suites/llm/performance/dapo-deepseek-v3-64n8g.sh`	New Bash test script that configures environment variables, launches GRPO training with DeepSeek v3, converts tensorboard logs to JSON metrics, and conditionally validates mean token error metrics.
Test Suite Manifest `tests/test_suites/performance_h100.txt`	Added new test script path to GRPO H100 BF16 SYNC performance test section.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

YAML configuration validation: Verify parameter correctness, Megatron/tensor parallelism settings, batch sizing, and sequence length configurations
Bash script logic: Review environment variable setup, configuration options passed to GRPO runner, metric validation thresholds, and conditional test execution
Test suite integration: Confirm proper placement and formatting in performance test manifest

Possibly related PRs

feat: add dapo recipe and test #1617: Adds identical DAPO DeepSeek v3 configuration and test script with overlapping file paths
cp: test: Perf recipe for v0.5 (1667) into r0.5.0 #1671: Also modifies tests/test_suites/performance_h100.txt and adds LLM performance configs and test scripts

Suggested labels

r0.5.0

Suggested reviewers

guyueh1
terrykong

Pre-merge checks and finishing touches

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	❓ Inconclusive	Unable to assess PR changes and test documentation without access to specific PR details, repository context, or pull request information.	Provide PR number, repository name, or direct access to PR description and changed files for assessment.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: adding a DAPO recipe configuration and corresponding test file for DeepSeek v3.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch cherry-pick-1617-r0.5.0

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 31a4a72 and 29f981d.

📒 Files selected for processing (3)

examples/configs/recipes/llm/performance/dapo-deepseek-v3-64n8g.yaml
tests/test_suites/llm/performance/dapo-deepseek-v3-64n8g.sh
tests/test_suites/performance_h100.txt

🧰 Additional context used

📓 Path-based instructions (5)

examples/configs/recipes/**/*.yaml