Handle none output in wmtp24++ by Froxyy-dev · Pull Request #1091 · NVIDIA-NeMo/Skills

Froxyy-dev · 2025-12-10T08:31:19Z

Summary by CodeRabbit

Bug Fixes

Fixed handling of missing translation data in metrics evaluation to prevent processing errors.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-10T08:33:56Z

📝 Walkthrough

Walkthrough

Adds a None-check in the translation metrics evaluation module. If the generation value is None during update, it is replaced with an empty string before being appended to the predictions list.

Changes

Cohort / File(s)	Change Summary
None-check for generation value `nemo_skills/evaluation/metrics/translation_metrics.py`	Added defensive None-check to replace None generation values with empty strings before appending to predictions list

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly relates to the main change: adding a None-check for generation output in the wmtp24++ metric, replacing None with an empty string.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch mwiniarek/wmtp24pp-handle-none

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

nemo_skills/evaluation/metrics/translation_metrics.py (1)
61-71: None-handling is correct; just confirm empty-string vs skipping behavior

Using an empty string when generation is None is a reasonable way to keep preds/gts aligned and avoid sacrebleu type errors; it will treat “no output” as a worst-case hypothesis, which is often what you want for metrics. If that’s the intended semantics, this looks good as-is.

If instead you’d prefer to ignore examples with missing generations (so they don’t affect BLEU at all), an alternative would be:
 for pred in predictions:
     src_lang = pred["source_language"]
     tgt_lang = pred["target_language"]
     generation = pred["generation"]
     ground_truth = pred["translation"]

-    if generation is None:
-        generation = ""
-
-    self.translation_dict[f"{src_lang}->{tgt_lang}"]["preds"].append(generation)
-    self.translation_dict[f"{src_lang}->{tgt_lang}"]["gts"].append(ground_truth)
+    if generation is not None:
+        self.translation_dict[f"{src_lang}->{tgt_lang}"]["preds"].append(generation)
+        self.translation_dict[f"{src_lang}->{tgt_lang}"]["gts"].append(ground_truth)
From a correctness standpoint, your current change is fine; it just encodes a particular policy about how missing outputs should influence metrics.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 774cef6 and fb94624.

📒 Files selected for processing (1)

nemo_skills/evaluation/metrics/translation_metrics.py (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: pre-commit
GitHub Check: unit-tests

gwarmstrong · 2025-12-11T20:55:05Z

Hi @Froxyy-dev looks good to me! Please update with signoff--you can see the DCO guidance on how to do this: https://github.com/NVIDIA-NeMo/Skills/pull/1091/checks?check_run_id=57708510000

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com>

Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com>

Froxyy-dev · 2025-12-12T10:34:30Z

Hi @gwarmstrong, I've signed the DCO!

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: wasiahmad <wasiahmad@ucla.edu>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Cheng-Ping Hsieh <chsieh@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: dgitman <dgitman@nvidia.com>

coderabbitai bot reviewed Dec 10, 2025

View reviewed changes

gwarmstrong and others added 2 commits December 12, 2025 11:32

MAINT update langugage-data dependency (#1076)

9014265

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com>

Handle none output

4b7fb63

Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com>

Froxyy-dev force-pushed the mwiniarek/wmtp24pp-handle-none branch from 84b847c to 4b7fb63 Compare December 12, 2025 10:32

Merge branch 'main' into mwiniarek/wmtp24pp-handle-none

0bf0a88

gwarmstrong approved these changes Dec 12, 2025

View reviewed changes

gwarmstrong enabled auto-merge (squash) December 12, 2025 17:03

gwarmstrong merged commit 3d3963d into main Dec 12, 2025
5 checks passed

gwarmstrong deleted the mwiniarek/wmtp24pp-handle-none branch December 12, 2025 17:21

wasiahmad pushed a commit that referenced this pull request Dec 19, 2025

Handle none output in wmtp24++ (#1091)

a0c40f4

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com>

wasiahmad pushed a commit that referenced this pull request Feb 4, 2026

Handle none output in wmtp24++ (#1091)

2007af2

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com>

dgtm777 pushed a commit that referenced this pull request Mar 18, 2026

Handle none output in wmtp24++ (#1091)

ca14b5d

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: Mateusz Winiarek <mwiniarek@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle none output in wmtp24++#1091

Handle none output in wmtp24++#1091
gwarmstrong merged 3 commits intomainfrom
mwiniarek/wmtp24pp-handle-none

Froxyy-dev commented Dec 10, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 10, 2025

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

gwarmstrong commented Dec 11, 2025

Uh oh!

Froxyy-dev commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Froxyy-dev commented Dec 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Bug Fixes

Uh oh!

coderabbitai bot commented Dec 10, 2025

Walkthrough

Changes

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gwarmstrong commented Dec 11, 2025

Uh oh!

Froxyy-dev commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Froxyy-dev commented Dec 10, 2025 •

edited by coderabbitai bot

Loading