cp: `docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3 (1301)` into `r0.4.0` by chtruong814 · Pull Request #1340 · NVIDIA-NeMo/RL

chtruong814 · 2025-10-10T22:01:32Z

beep boop [🤖]: Hi @snowmanwwg 👋,

we've cherry picked #1301 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

Summary by CodeRabbit

Documentation
- Updated the News section with four new entries:
  - 9/30/2025: Accelerated RL on GCP with NeMo RL.
  - 9/27/2025: FP8 Quantization in NeMo RL.
  - 8/15/2025: Optimizing weight transfer in large MoE models by 10x.
  - 7/31/2025: NeMo-RL V0.3—scalable, performant post-training via Megatron-Core.
- Improves visibility into recent milestones and capabilities.

…-RL V0.3 (#1301) Signed-off-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> Signed-off-by: Terry Kong <terrycurtiskong@gmail.com> Co-authored-by: Terry Kong <terrycurtiskong@gmail.com> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

coderabbitai · 2025-10-10T22:05:25Z

📝 Walkthrough

Walkthrough

Adds four new bullet entries to the README’s News section dated 9/30/2025, 9/27/2025, 8/15/2025, and 7/31/2025. No other files or public entities are modified.

Changes

Cohort / File(s)	Summary
Docs: README News updates `README.md`	Appends four News items: Accelerated RL on GCP with NeMo RL (9/30/2025), FP8 Quantization in NeMo RL (9/27/2025), MoE weight transfer optimization (8/15/2025), and NeMo‑RL V0.3 announcement (7/31/2025).

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

Possibly related PRs

docs: Restructure README with backend-specific quick start and setup guides #1091 — Also restructures README, including News/Previous News sections; overlaps with this change area.
docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3 #1301 — Adds the same FP8, MoE optimization, and NeMo‑RL V0.3 News items to README.

Suggested labels

r0.4.0, CI:docs

Suggested reviewers

snowmanwwg
terrykong

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Title Check	⚠️ Warning	The title embeds internal cherry-pick metadata, quoted commit messages, and branch references that obscure the actual change and exceed the recommended brevity; it fails to succinctly convey that the PR simply adds new items to the News section.	Please simplify the title to focus on the core update, for example: “docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3,” removing the “cp:” prefix, PR number, and branch details.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.
Test Results For Major Changes	✅ Passed	Based on the provided summary and PR objectives, this change only updates README.md to add four News entries and does not introduce code changes, features, refactors, or behavior that could affect numerics or performance. There are no alterations to public entities and the only review feedback concerns a Markdown formatting fix. Since the changes are purely documentation and minor in nature, test results or performance benchmarks are not required for this PR to pass this check.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch cherry-pick-1301-r0.4.0

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

README.md (1)
8-11: Conform nested list indentation to markdownlint (MD007)

Indent nested list items by 2 spaces (currently 4), to satisfy tooling and keep consistency.
-    * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).
+  * Student generates on-policy sequences and aligns logits to a larger teacher via KL, achieving near-larger-model quality at lower cost than RL. See [On-policy Distillation](#on-policy-distillation).
As flagged by markdownlint (MD007). [Based on static analysis hints]

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9689bca and f90a890.

📒 Files selected for processing (1)

README.md (1 hunks)

🧰 Additional context used

🪛 markdownlint-cli2 (0.18.1)

README.md

9-9: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (5)

GitHub Check: Lint check
GitHub Check: Lint check
GitHub Check: Lint check
GitHub Check: Post submodule check comment / Comment on PR
GitHub Check: Post automodel integration comment / Comment on PR

🔇 Additional comments (2)

README.md (2)

7-7: LGTM on added News items

The FP8 Quantization, MoE optimization, and NeMo‑RL V0.3 entries look correct and well‑linked.

Also applies to: 10-11

6-11: Confirm cherry-pick scope
The PR objective lists three entries (FP8 Quantization 9/27, MoE optimization 8/15, NeMo-RL V0.3 7/31), but README also includes the GCP RL item (9/30). Was this extra entry intended for r0.4.0?

README.md

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: Terry Kong <terrycurtiskong@gmail.com>

chtruong814 requested a review from a team as a code owner October 10, 2025 22:01

chtruong814 requested a review from snowmanwwg October 10, 2025 22:01

chtruong814 added cherry-pick Run CICD labels Oct 10, 2025

chtruong814 temporarily deployed to nemo-ci October 10, 2025 22:01 — with GitHub Actions Inactive

chtruong814 temporarily deployed to nemo-ci October 10, 2025 22:05 — with GitHub Actions Inactive

coderabbitai bot reviewed Oct 10, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

chtruong814 temporarily deployed to nemo-ci October 10, 2025 22:05 — with GitHub Actions Inactive

chtruong814 temporarily deployed to nemo-ci October 10, 2025 22:09 — with GitHub Actions Inactive

Update README.md

164f5d2

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: Terry Kong <terrycurtiskong@gmail.com>

terrykong approved these changes Oct 10, 2025

View reviewed changes

terrykong temporarily deployed to nemo-ci October 10, 2025 23:35 — with GitHub Actions Inactive

terrykong added the CI:docs Run doctest label Oct 10, 2025

terrykong enabled auto-merge (squash) October 10, 2025 23:36

terrykong temporarily deployed to nemo-ci October 10, 2025 23:36 — with GitHub Actions Inactive

terrykong merged commit 15b3994 into r0.4.0 Oct 10, 2025
38 of 40 checks passed

terrykong deleted the cherry-pick-1301-r0.4.0 branch October 10, 2025 23:42

coderabbitai bot mentioned this pull request Dec 15, 2025

docs: Revise news section for nemotron v3 and DAPO algorithm support #1640

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cp: `docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3 (1301)` into `r0.4.0`#1340

cp: `docs: Add news items for FP8 Quantization, MoE optimization, and NeMo-RL V0.3 (1301)` into `r0.4.0`#1340
terrykong merged 2 commits intor0.4.0from
cherry-pick-1301-r0.4.0

chtruong814 commented Oct 10, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 10, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chtruong814 commented Oct 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 10, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chtruong814 commented Oct 10, 2025 •

edited by coderabbitai bot

Loading