Skip to content

docs: Revise news section for nemotron v3 and DAPO algorithm support#1640

Merged
terrykong merged 2 commits intomainfrom
snowmanwwg-patch-4
Dec 16, 2025
Merged

docs: Revise news section for nemotron v3 and DAPO algorithm support#1640
terrykong merged 2 commits intomainfrom
snowmanwwg-patch-4

Conversation

@snowmanwwg
Copy link
Contributor

@snowmanwwg snowmanwwg commented Dec 15, 2025

Updated news section with nemotron v3 and new DAPO algorithm support details and adjusted formatting.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

List issues that this PR closes (syntax):

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
  • Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

  • ...

Summary by CodeRabbit

Documentation

  • Added news highlighting NeMo-RL training accomplishments and new model releases.
  • Reorganized and consolidated news items for improved clarity.
  • Introduced a "Previous News" section to distinguish recent announcements from historical items.

✏️ Tip: You can customize this high-level summary in your review settings.

Updated news section with nemotron v3 and  new DAPO algorithm support details and adjusted formatting.

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
@snowmanwwg snowmanwwg requested a review from a team as a code owner December 15, 2025 23:49
@terrykong terrykong changed the title Revise news section for nemotron v3 and DAPO algorithm support docs: Revise news section for nemotron v3 and DAPO algorithm support Dec 15, 2025
terrykong
terrykong previously approved these changes Dec 15, 2025
@terrykong terrykong added the CI:docs Run doctest label Dec 15, 2025
@terrykong terrykong enabled auto-merge (squash) December 15, 2025 23:50
@coderabbitai
Copy link
Contributor

coderabbitai bot commented Dec 15, 2025

📝 Walkthrough

Walkthrough

The README.md file has been updated to reorganize and add news items. Changes include introducing a new "Previous News" collapsible section, adding a NeMo-RL training announcement, restructuring a DAPO-related entry, and moving older news items.

Changes

Cohort / File(s) Summary
README News Section Updates
README.md
Added NeMo-RL NeMotron-3-Nano training announcement; consolidated 10/10/2025 DAPO entry describing extensions; created "Previous News" collapsible section; added 9/30/2025 Accelerated RL on GCP news item.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~5 minutes

  • Documentation-only changes with straightforward content reorganization and additions
  • Consistent formatting pattern across new news entries
  • No code logic or structural changes to evaluate

Possibly related PRs

Suggested labels

CI:docs

Suggested reviewers

  • terrykong
  • chtruong814

Pre-merge checks and finishing touches

✅ Passed checks (4 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately reflects the main changes in the pull request, which involve updating the README news section with Nemotron v3 information and DAPO algorithm details.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Test Results For Major Changes ✅ Passed Pull request contains only README.md documentation updates with no code changes, new features, breaking changes, or algorithmic modifications.
✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch snowmanwwg-patch-4

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (1)
README.md (1)

15-16: Address pre-existing markdown indentation violations (MD007).

The linter flags unordered list indentation issues on lines 15, 16, and 21. While these lines are not part of your current changeset, fixing them now would improve overall markdown compliance. Markdown best practice requires 2-space indentation for nested lists, not 4 spaces.

Apply this diff to fix the indentation:

 * [12/1/2025] [Release v0.4.0!](https://github.com/NVIDIA-NeMo/RL/releases/tag/v0.4.0)
-    * First release with official NGC Container [nvcr.io/nvidia/nemo-rl:v0.4.0](https://registry.ngc.nvidia.com/orgs/nvidia/containers/nemo-rl/tags).
-    * 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/1u5lmjHOsYpJqXaeYstjw7Qbzvbo67U0v?usp=sharing) to get a head start on your experimentation.
+  * First release with official NGC Container [nvcr.io/nvidia/nemo-rl:v0.4.0](https://registry.ngc.nvidia.com/orgs/nvidia/containers/nemo-rl/tags).
+  * 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/1u5lmjHOsYpJqXaeYstjw7Qbzvbo67U0v?usp=sharing) to get a head start on your experimentation.
 * [9/27/2025] [FP8 Quantization in NeMo RL](https://github.com/NVIDIA-NeMo/RL/discussions/1216)
 * [9/25/2025] On-policy Distillation 
-    * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).
+  * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).

Per markdownlint MD007 rule, while these are pre-existing in the file, addressing them now would align with the tooling and prevent future lint failures.

Also applies to: 21-21

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a010564 and b5df2ff.

📒 Files selected for processing (1)
  • README.md (1 hunks)
🧰 Additional context used
📓 Path-based instructions (1)
!(**/tests/**|**/test_*.py|**/test_*.sh)

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

Add the NVIDIA copyright header to all Python files and shell scripts (excluding tests). The header should include the current year

Files:

  • README.md
🪛 markdownlint-cli2 (0.18.1)
README.md

15-15: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)


16-16: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)


21-21: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (8)
  • GitHub Check: build-container / main
  • GitHub Check: sphinx-build / Build docs
  • GitHub Check: Lint check
  • GitHub Check: sphinx-build / Build docs
  • GitHub Check: build-container / main
  • GitHub Check: Lint check
  • GitHub Check: Post automodel integration comment / Comment on PR
  • GitHub Check: Post submodule check comment / Comment on PR
🔇 Additional comments (3)
README.md (3)

13-13: LGTM: Announce NeMo-RL's Nemotron v3 training achievement.

The new announcement effectively highlights NeMo-RL's role in training a production model and provides a direct link to reproducible code. This is a valuable addition to the news section.


17-18: LGTM: DAPO section consolidation improves readability.

Restructuring the DAPO announcement from two separate bullets into a single, comprehensive entry with linked extensions and reference documentation is an improvement. The content clearly describes the algorithm's features and provides actionable next steps via the guide link.


26-26: LGTM: Previous News section organized with GCP acceleration announcement.

Adding the 9/30/2025 GCP acceleration announcement to a collapsible "Previous News" section helps keep the active news prominent while preserving historical context. The date and link formatting align with existing patterns.

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
@terrykong terrykong merged commit d4fffe0 into main Dec 16, 2025
26 checks passed
@terrykong terrykong deleted the snowmanwwg-patch-4 branch December 16, 2025 05:43
DeL-TaiseiOzaki pushed a commit to DeL-TaiseiOzaki/RL that referenced this pull request Jan 8, 2026
…VIDIA-NeMo#1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
parthmannan pushed a commit to parthmannan/RL that referenced this pull request Jan 15, 2026
…VIDIA-NeMo#1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
@coderabbitai coderabbitai bot mentioned this pull request Jan 31, 2026
4 tasks
yuanhangsu1986 pushed a commit to yuanhangsu1986/RL-Nemontron-Edge-Omni that referenced this pull request Feb 12, 2026
…VIDIA-NeMo#1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Signed-off-by: yuanhangs <yuanhangs@nvidia.com>
yuanhangsu1986 pushed a commit to yuanhangsu1986/RL-Nemontron-Edge-Omni that referenced this pull request Feb 21, 2026
…VIDIA-NeMo#1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Signed-off-by: yuanhangs <yuanhangs@nvidia.com>
seonjinn pushed a commit that referenced this pull request Mar 8, 2026
…1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
seonjinn pushed a commit that referenced this pull request Mar 8, 2026
…1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
seonjinn pushed a commit that referenced this pull request Mar 9, 2026
…1640)

Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI:docs Run doctest

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants