Audiometrics unification by Jorjeous · Pull Request #1093 · NVIDIA-NeMo/Skills

Jorjeous · 2025-12-10T13:24:36Z

This PR is part of bulk improvements proposed introduced in #1072

Current one for unification of all audio-related metrics

Now all tasks related to audio can be evaluated with one evaluator, making it easier to add new datasets

Summary by CodeRabbit

New Features
- Introduced audio evaluation module with support for multiple audio tasks including automatic speech recognition (ASR), translation, character error rate, and hallucination detection.
- Added audio metrics computation capabilities for comprehensive evaluation of audio-based results.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-10T13:29:14Z

📝 Walkthrough

Walkthrough

This pull request introduces comprehensive audio evaluation capabilities by adding a new audio evaluator module with task-specific evaluation functions (ASR, ASR-PC, translation, CER, hallucination detection, punctuation/capitalization analysis), an AudioMetrics class for aggregating evaluation metrics, and registering these components in the existing evaluator and metrics registries.

Changes

Cohort / File(s)	Summary
Documentation `README.md`	Added blank line after introductory paragraph for formatting.
Registry Integration `nemo_skills/evaluation/evaluator/__init__.py`, `nemo_skills/evaluation/metrics/map_metrics.py`	Registered new audio evaluator (`eval_audio`) in EVALUATOR_MAP and new audio metrics (`AudioMetrics`) in METRICS_MAP to enable audio evaluation as a callable evaluator and metrics handler.
Audio Evaluator Module `nemo_skills/evaluation/evaluator/audio.py`	New comprehensive module implementing audio evaluation pipeline: AudioEvaluatorConfig dataclass for configuration; normalization and tokenization helpers (normalize_whitespace, split_tokens, extract_punctuation); error rate calculations (PER via DP-based alignment); ASR evaluation pathways (evaluate_asr, evaluate_asr_pc, evaluate_asr_leaderboard); preprocessing utilities (preprocess_asr_text, preprocess_hf_leaderboard); translation and CER evaluation; hallucination detection scaffold; punctuation/capitalization rate analysis; end-to-end driver (eval_audio) that reads JSONL, processes samples via evaluate_sample with task-type dispatch, and writes results back.
Audio Metrics Module `nemo_skills/evaluation/metrics/audio_metrics.py`	New AudioMetrics class (subclass of BaseMetrics) with metric lists for WER, WER_C, WER_PC, PER, BLEU, CER, hallucination_rate, pc_rate, punct_f1, cap_accuracy, and char_rate; judge-based evaluation support with `_extract_judge_result`; score dictionary derivation via `_get_score_dict`; metric accumulation and aggregation via `update` and `get_metrics`; pass@k and majority@k computation; result conversion to percentage-based metrics; module-level `compute_score` function for weighted multi-benchmark aggregation.

Sequence Diagram

sequenceDiagram
    participant User
    participant eval_audio as eval_audio()
    participant evaluate_sample as evaluate_sample()
    participant TaskEval as Task-Specific<br/>Evaluators
    participant FileOps as File I/O
    
    User->>eval_audio: cfg (AudioEvaluatorConfig)
    activate eval_audio
    eval_audio->>FileOps: Read JSONL input
    activate FileOps
    FileOps-->>eval_audio: sample[]
    deactivate FileOps
    
    loop For each sample
        eval_audio->>evaluate_sample: sample, config
        activate evaluate_sample
        evaluate_sample->>evaluate_sample: Inspect task_type
        
        alt Task: ASR
            evaluate_sample->>TaskEval: evaluate_asr(ref, hyp)
        else Task: ASR-PC
            evaluate_sample->>TaskEval: evaluate_asr_pc(ref, hyp)
        else Task: Translation
            evaluate_sample->>TaskEval: evaluate_translation(ref, hyp)
        else Task: CER
            evaluate_sample->>TaskEval: evaluate_cer(ref, hyp)
        else Task: Hallucination
            evaluate_sample->>TaskEval: evaluate_hallucination(ref, hyp, context)
        else Task: PC-Rate
            evaluate_sample->>TaskEval: evaluate_pc_rate(ref, hyp)
        end
        
        activate TaskEval
        TaskEval-->>evaluate_sample: metrics dict
        deactivate TaskEval
        
        evaluate_sample->>evaluate_sample: Augment sample<br/>(is_correct, char_rate, etc.)
        evaluate_sample-->>eval_audio: enriched sample
        deactivate evaluate_sample
    end
    
    eval_audio->>FileOps: Write results to JSONL
    deactivate eval_audio
    FileOps-->>User: Updated file

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

audio.py — New 400+ line module with 15+ public functions and sophisticated logic (DP-based alignment for PER calculation, multiple preprocessing pathways, task dispatch, sample enrichment). Requires careful verification of error rate calculations and evaluation correctness across ASR, translation, and hallucination tasks.
audio_metrics.py — Large metrics aggregation class with 8+ methods, multiple metric accumulation paths, judge-based evaluation parsing, and pass@k/majority@k computation. Needs validation of metric aggregation logic and percentage conversion.
Integration points — Audio evaluator and metrics must be correctly wired into existing registry maps; verify imports and map keys align with downstream usage.

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'Audiometrics unification' directly corresponds to the main objective of consolidating audio evaluation logic into unified evaluators and metrics.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch Audiometrics_unification

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (5)

nemo_skills/evaluation/metrics/audio_metrics.py (3)
43-76: Consider adding a reset method to clear metric score lists.

The __init__ initializes various score lists (wer_scores, cer_scores, etc.), but there's no reset method to clear them. If BaseMetrics.reset() is called, only the parent's state is reset while these lists retain stale data.

Consider adding:
+    def reset(self):
+        """Reset all metric tracking lists."""
+        super().reset()
+        self.wer_scores = []
+        self.wer_c_scores = []
+        self.wer_pc_scores = []
+        self.per_scores = []
+        self.bleu_scores = []
+        self.cer_scores = []
+        self.hallucination_scores = []
+        self.pc_rate_scores = []
+        self.punct_f1_scores = []
+        self.cap_accuracy_scores = []
+        self.char_rate_scores = []
77-95: Consider moving re import to module level.

The re module is imported inside this method, but re is already used at the module level in other parts of the codebase. Since this is a standard library module with minimal overhead, importing at module level improves clarity.
 import logging
+import re
 
 from nemo_skills.evaluation.metrics.base import BaseMetrics, as_int, as_percentage
Then remove the local import:
-        import re
-
         if re.search(r"\byes\b", judgement_text, re.IGNORECASE):
346-359: Rename unused loop variable per linter recommendation.

The benchmark_name variable is not used in the loop body.
-        for benchmark_name, benchmark_data in benchmarks.items():
+        for _benchmark_name, benchmark_data in benchmarks.items():
nemo_skills/evaluation/evaluator/audio.py (2)
190-209: Avoid catching broad Exception.

Catching all exceptions masks specific errors (e.g., import failures vs. computation errors). Consider catching more specific exceptions from sacrebleu.
     try:
         import sacrebleu

         ref = [reference.strip()]
         hyp = hypothesis.strip()
         bleu = sacrebleu.sentence_bleu(hyp, ref)
         bleu_score = bleu.score / 100.0

         return {
             "bleu": bleu_score,
             "is_correct": bleu_score > 0.3,
         }
-    except Exception as e:
+    except (ImportError, ValueError, TypeError) as e:
         return {
             "bleu": 0.0,
             "is_correct": False,
             "error": str(e),
         }
302-316: In-place file overwrite risks data loss on failure.

If the process crashes during the write phase (lines 312-314), the original data could be corrupted. Consider writing to a temporary file first, then atomically renaming.
+    import tempfile
+    import os
+
     with open(jsonl_file, "rt", encoding="utf-8") as fin:
         data = [json.loads(line) for line in fin]
     ...
-    with open(jsonl_file, "wt", encoding="utf-8") as fout:
-        for sample in data:
-            fout.write(json.dumps(sample) + "\n")
+    # Write to temp file first, then rename atomically
+    dirname = os.path.dirname(jsonl_file) or "."
+    with tempfile.NamedTemporaryFile(mode="wt", dir=dirname, delete=False, suffix=".tmp") as fout:
+        for sample in data:
+            fout.write(json.dumps(sample) + "\n")
+        temp_path = fout.name
+    os.replace(temp_path, jsonl_file)

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 774cef6 and fefcf9f.

📒 Files selected for processing (5)

README.md (1 hunks)
nemo_skills/evaluation/evaluator/__init__.py (2 hunks)
nemo_skills/evaluation/evaluator/audio.py (1 hunks)
nemo_skills/evaluation/metrics/audio_metrics.py (1 hunks)
nemo_skills/evaluation/metrics/map_metrics.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-12-02T21:26:17.342Z

Learnt from: CR
Repo: NVIDIA-NeMo/Skills PR: 0
File: CONTRIBUTING.md:0-0
Timestamp: 2025-12-02T21:26:17.342Z
Learning: Follow the existing code style and conventions in the Nemo-Skills project

Applied to files:

README.md

🧬 Code graph analysis (4)

nemo_skills/evaluation/evaluator/__init__.py (1)

nemo_skills/evaluation/evaluator/audio.py (1)

eval_audio (293-316)

nemo_skills/evaluation/metrics/map_metrics.py (1)

nemo_skills/evaluation/metrics/audio_metrics.py (1)

AudioMetrics (43-306)

nemo_skills/evaluation/metrics/audio_metrics.py (2)

nemo_skills/evaluation/metrics/base.py (1)

BaseMetrics (23-434)

nemo_skills/utils.py (1)

get_logger_name (39-43)

nemo_skills/evaluation/evaluator/audio.py (2)

nemo_skills/utils.py (2)

get_logger_name (39-43)

nested_dataclass (69-102)

nemo_skills/evaluation/metrics/audio_metrics.py (1)

update (157-198)

🪛 Ruff (0.14.8)

nemo_skills/evaluation/metrics/audio_metrics.py

211-211: Loop control variable agg_mode not used within loop body

Rename unused agg_mode to _agg_mode

(B007)

346-346: Loop control variable benchmark_name not used within loop body

Rename unused benchmark_name to _benchmark_name

(B007)

nemo_skills/evaluation/evaluator/audio.py

200-203: Consider moving this statement to an else block

(TRY300)

204-204: Do not catch blind exception: Exception

(BLE001)

223-223: Unused function argument: reference

(ARG001)

223-223: PEP 484 prohibits implicit Optional

Convert to T | None

(RUF013)

277-277: zip() without an explicit strict= parameter

Add explicit value for parameter strict=

(B905)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: unit-tests
GitHub Check: pre-commit

🔇 Additional comments (8)

README.md (1)

5-5: Minor formatting change looks fine.

This is just an extra blank line addition with no impact on content.

nemo_skills/evaluation/evaluator/__init__.py (2)

18-18: Import follows existing conventions.

The import of eval_audio is correctly placed and follows the existing pattern for function-based evaluators.

60-61: Registration in EVALUATOR_MAP looks correct.

The audio evaluator is properly registered with a descriptive comment, consistent with other entries in the map.

nemo_skills/evaluation/metrics/map_metrics.py (2)

22-22: Import correctly placed.

The AudioMetrics import maintains alphabetical ordering with other metrics imports.

51-51: Metrics registration is consistent.

The "audio" key correctly maps to AudioMetrics and aligns with the evaluator registration in EVALUATOR_MAP.

nemo_skills/evaluation/evaluator/audio.py (3)

53-87: PER calculation logic looks correct.

The Punctuation Error Rate implementation using dynamic programming for edit distance is well-structured. The handling of edge cases (both empty punctuation sequences) is appropriate.

319-394: Sample evaluation dispatch logic is well-structured.

The function handles multiple task types cleanly with appropriate fallbacks for missing generation and unknown task types. The additional char_rate metrics provide useful diagnostic information.

124-143: Preprocessing functions are well-implemented.

The lazy import of EnglishTextNormalizer inside preprocess_asr_text is appropriate since whisper may be an optional dependency. Both normalization functions follow consistent patterns.

nemo_skills/evaluation/evaluator/audio.py

nemo_skills/evaluation/metrics/audio_metrics.py

Jorjeous · 2025-12-10T16:47:35Z

workflows/tests.yml
Cpu tests fails dur to this file and "question field" have no relation to this PR

gwarmstrong

Overall looks pretty good, just a note to please try to use the BaseEvaluator. It is an open issue to convert other evaluations (#829 ), but I think it will be helpful to have it here from the start, if possible.

nemo_skills/evaluation/evaluator/audio.py

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: i-vainn <imoshkov@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

…ize_robustness generic for more benchmarks, update docstrings. (#1079) Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

- Make AudioEvaluatorConfig inherit from BaseEvaluatorConfig - Create AudioEvaluator class with async eval_single() method - Refactor evaluate_sample() to return updates dict instead of modified sample - Register AudioEvaluator in EVALUATOR_CLASS_MAP - Keep eval_audio() function for backward compatibility - Enables interleaved evaluation with inference for better GPU utilization Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com> Signed-off-by: George Armstrong <georgea@nvidia.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com> Signed-off-by: wasiahmad <wasiahmad@ucla.edu>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com> Signed-off-by: wasiahmad <wasiahmad@ucla.edu>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com> Signed-off-by: Cheng-Ping Hsieh <chsieh@nvidia.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com> Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: i-vainn <imoshkov@nvidia.com> Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Co-authored-by: Nick Ludwig <nliudvig@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Co-authored-by: Ivan <imoshkov@nvidia.com> Co-authored-by: Wojciech Prazuch <wojciechprazuch3@gmail.com> Co-authored-by: gnalbandyan <153070076+gnalbandyan@users.noreply.github.com> Signed-off-by: dgitman <dgitman@nvidia.com>

Jorjeous requested a review from melllinia December 10, 2025 13:24

Jorjeous assigned Jorjeous and melllinia Dec 10, 2025

coderabbitai bot reviewed Dec 10, 2025

View reviewed changes

nemo_skills/evaluation/evaluator/audio.py Show resolved Hide resolved

nemo_skills/evaluation/evaluator/audio.py Show resolved Hide resolved

nemo_skills/evaluation/metrics/audio_metrics.py Outdated Show resolved Hide resolved

Jorjeous requested a review from gwarmstrong December 10, 2025 14:15

Jorjeous mentioned this pull request Dec 10, 2025

Bulk improvements nemo skills pipeline #1072

Closed

gwarmstrong requested changes Dec 11, 2025

View reviewed changes

nemo_skills/evaluation/evaluator/audio.py Outdated Show resolved Hide resolved

Jorjeous and others added 15 commits December 11, 2025 04:15

initial commnit

84706f4

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Unified ASR metrics

da80aa3

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

running pre-commit

79d4f91

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Fixes to support SWE-bench Multilingual (#1064)

3eaab3e

Signed-off-by: Nikolai Ludwig <nliudvig@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

fix: IFBench error handling and build improvements (#1073)

75bf380

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

FIX math verify handle leading zeros and int literals cases (#1074)

7926637

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

build: move data preparation to beginning of gpu tests build (#1077)

5603f43

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

MAINT update langugage-data dependency (#1076)

e853995

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

MAINT: Add audio requirements to vllm image (#1081)

a630cc5

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Add apex-shortlist dataset (#1080)

592cabf

Signed-off-by: i-vainn <imoshkov@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Introduce regex for small differences of formatting from judge (#1082)

5bf16b3

Signed-off-by: George Armstrong <georgea@nvidia.com> Co-authored-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Add LCB Prompts, fix regex bug in robust_eval, remove CR, make summar…

bbb9fb5

…ize_robustness generic for more benchmarks, update docstrings. (#1079) Signed-off-by: Grigor Nalbandyan <gnalbandyan@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

address rabbit

0a5cc22

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

MAINT pin nemo-evaluator (#1095)

0d36920

Signed-off-by: George Armstrong <georgea@nvidia.com> Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Jorjeous force-pushed the Audiometrics_unification branch from cfe2cd1 to c31a76b Compare December 11, 2025 12:15

Jorjeous requested a review from gwarmstrong December 11, 2025 15:06

Merge branch 'main' into Audiometrics_unification

f04bbd7

gwarmstrong approved these changes Dec 11, 2025

View reviewed changes

gwarmstrong enabled auto-merge (squash) December 11, 2025 19:17

gwarmstrong merged commit 5b6885a into main Dec 11, 2025
5 checks passed

gwarmstrong deleted the Audiometrics_unification branch December 11, 2025 19:35

coderabbitai bot mentioned this pull request Dec 23, 2025

HF ASR Leaderboard Fix #1140

Merged

coderabbitai bot mentioned this pull request Jan 22, 2026

Numb3rs ds addition #1174

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audiometrics unification#1093

Audiometrics unification#1093
gwarmstrong merged 16 commits intomainfrom
Audiometrics_unification

Jorjeous commented Dec 10, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 10, 2025

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jorjeous commented Dec 10, 2025

Uh oh!

gwarmstrong left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

Jorjeous commented Dec 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 10, 2025

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jorjeous commented Dec 10, 2025

Uh oh!

gwarmstrong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Jorjeous commented Dec 10, 2025 •

edited by coderabbitai bot

Loading