docs: Revise news section for nemotron v3 and DAPO algorithm support by snowmanwwg · Pull Request #1640 · NVIDIA-NeMo/RL

snowmanwwg · 2025-12-15T23:49:19Z

Updated news section with nemotron v3 and new DAPO algorithm support details and adjusted formatting.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Summary by CodeRabbit

Documentation

Added news highlighting NeMo-RL training accomplishments and new model releases.
Reorganized and consolidated news items for improved clarity.
Introduced a "Previous News" section to distinguish recent announcements from historical items.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Updated news section with nemotron v3 and new DAPO algorithm support details and adjusted formatting. Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

coderabbitai · 2025-12-15T23:51:50Z

📝 Walkthrough

Walkthrough

The README.md file has been updated to reorganize and add news items. Changes include introducing a new "Previous News" collapsible section, adding a NeMo-RL training announcement, restructuring a DAPO-related entry, and moving older news items.

Changes

Cohort / File(s)	Summary
README News Section Updates `README.md`	Added NeMo-RL NeMotron-3-Nano training announcement; consolidated 10/10/2025 DAPO entry describing extensions; created "Previous News" collapsible section; added 9/30/2025 Accelerated RL on GCP news item.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~5 minutes

Documentation-only changes with straightforward content reorganization and additions
Consistent formatting pattern across new news entries
No code logic or structural changes to evaluate

Possibly related PRs

cp: docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3 (1301) into r0.4.0 #1340 — Overlaps on adding FP8-related news and NeMotron announcement to README.md News section
docs: Restructure README with backend-specific quick start and setup guides #1091 — Overlaps on introducing/reworking the "Previous News" collapsible block in README.md
docs: update latest news list #1390 — Direct overlap on editing the README.md News section and "Previous News" section structure

Suggested labels

CI:docs

Suggested reviewers

terrykong
chtruong814

Pre-merge checks and finishing touches

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately reflects the main changes in the pull request, which involve updating the README news section with Nemotron v3 information and DAPO algorithm details.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Test Results For Major Changes	✅ Passed	Pull request contains only README.md documentation updates with no code changes, new features, breaking changes, or algorithmic modifications.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch snowmanwwg-patch-4

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

README.md (1)

15-16: Address pre-existing markdown indentation violations (MD007).

The linter flags unordered list indentation issues on lines 15, 16, and 21. While these lines are not part of your current changeset, fixing them now would improve overall markdown compliance. Markdown best practice requires 2-space indentation for nested lists, not 4 spaces.

Apply this diff to fix the indentation:

 * [12/1/2025] [Release v0.4.0!](https://github.com/NVIDIA-NeMo/RL/releases/tag/v0.4.0)
-    * First release with official NGC Container [nvcr.io/nvidia/nemo-rl:v0.4.0](https://registry.ngc.nvidia.com/orgs/nvidia/containers/nemo-rl/tags).
-    * 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/1u5lmjHOsYpJqXaeYstjw7Qbzvbo67U0v?usp=sharing) to get a head start on your experimentation.
+  * First release with official NGC Container [nvcr.io/nvidia/nemo-rl:v0.4.0](https://registry.ngc.nvidia.com/orgs/nvidia/containers/nemo-rl/tags).
+  * 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/1u5lmjHOsYpJqXaeYstjw7Qbzvbo67U0v?usp=sharing) to get a head start on your experimentation.
 * [9/27/2025] [FP8 Quantization in NeMo RL](https://github.com/NVIDIA-NeMo/RL/discussions/1216)
 * [9/25/2025] On-policy Distillation 
-    * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).
+  * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).

Per markdownlint MD007 rule, while these are pre-existing in the file, addressing them now would align with the tooling and prevent future lint failures.

Also applies to: 21-21

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a010564 and b5df2ff.

📒 Files selected for processing (1)

README.md (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

!(**/tests/**|**/test_*.py|**/test_*.sh)

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

Add the NVIDIA copyright header to all Python files and shell scripts (excluding tests). The header should include the current year

Files:

README.md

🪛 markdownlint-cli2 (0.18.1)

README.md

15-15: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

16-16: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

21-21: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (8)

GitHub Check: build-container / main
GitHub Check: sphinx-build / Build docs
GitHub Check: Lint check
GitHub Check: sphinx-build / Build docs
GitHub Check: build-container / main
GitHub Check: Lint check
GitHub Check: Post automodel integration comment / Comment on PR
GitHub Check: Post submodule check comment / Comment on PR

🔇 Additional comments (3)

README.md (3)

13-13: LGTM: Announce NeMo-RL's Nemotron v3 training achievement.

The new announcement effectively highlights NeMo-RL's role in training a production model and provides a direct link to reproducible code. This is a valuable addition to the news section.

17-18: LGTM: DAPO section consolidation improves readability.

Restructuring the DAPO announcement from two separate bullets into a single, comprehensive entry with linked extensions and reference documentation is an improvement. The content clearly describes the algorithm's features and provides actionable next steps via the guide link.

26-26: LGTM: Previous News section organized with GCP acceleration announcement.

Adding the 9/30/2025 GCP acceleration announcement to a collapsible "Previous News" section helps keep the active news prominent while preserving historical context. The date and link formatting align with existing patterns.

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

…VIDIA-NeMo#1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

…VIDIA-NeMo#1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> Signed-off-by: Parth Mannan <pmannan@nvidia.com>

…VIDIA-NeMo#1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> Signed-off-by: yuanhangs <yuanhangs@nvidia.com>

…1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Revise news section for nemotron v3 and DAPO algorithm support

b5df2ff

Updated news section with nemotron v3 and new DAPO algorithm support details and adjusted formatting. Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

snowmanwwg requested a review from a team as a code owner December 15, 2025 23:49

snowmanwwg had a problem deploying to nemo-ci December 15, 2025 23:49 — with GitHub Actions Error

terrykong changed the title ~~Revise news section for nemotron v3 and DAPO algorithm support~~ docs: Revise news section for nemotron v3 and DAPO algorithm support Dec 15, 2025

terrykong previously approved these changes Dec 15, 2025

View reviewed changes

terrykong added the CI:docs Run doctest label Dec 15, 2025

terrykong enabled auto-merge (squash) December 15, 2025 23:50

terrykong temporarily deployed to nemo-ci December 15, 2025 23:50 — with GitHub Actions Inactive

coderabbitai bot reviewed Dec 15, 2025

View reviewed changes

Update news section with new date for NeMo-RL

d4cede1

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

snowmanwwg dismissed terrykong’s stale review via d4cede1 December 15, 2025 23:52

snowmanwwg temporarily deployed to nemo-ci December 15, 2025 23:52 — with GitHub Actions Inactive

snowmanwwg temporarily deployed to nemo-ci December 15, 2025 23:56 — with GitHub Actions Inactive

terrykong approved these changes Dec 16, 2025

View reviewed changes

terrykong merged commit d4fffe0 into main Dec 16, 2025
26 checks passed

terrykong deleted the snowmanwwg-patch-4 branch December 16, 2025 05:43

coderabbitai bot mentioned this pull request Jan 31, 2026

docs: update readme post 0.5 #1856

Merged

4 tasks

seonjinn pushed a commit that referenced this pull request Mar 8, 2026

docs: Revise news section for nemotron v3 and DAPO algorithm support (#…

8ecf5b7

…1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

seonjinn pushed a commit that referenced this pull request Mar 8, 2026

docs: Revise news section for nemotron v3 and DAPO algorithm support (#…

d9bb1ff

…1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

seonjinn pushed a commit that referenced this pull request Mar 9, 2026

docs: Revise news section for nemotron v3 and DAPO algorithm support (#…

35fa9da

…1640) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Revise news section for nemotron v3 and DAPO algorithm support#1640

docs: Revise news section for nemotron v3 and DAPO algorithm support#1640
terrykong merged 2 commits intomainfrom
snowmanwwg-patch-4

snowmanwwg commented Dec 15, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 15, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

snowmanwwg commented Dec 15, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Documentation

Uh oh!

coderabbitai bot commented Dec 15, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

snowmanwwg commented Dec 15, 2025 •

edited by coderabbitai bot

Loading