Skip to content

Add AudioBench and Librispeech-PC benchmarks for speech and audio language models#1043

Closed
Jorjeous wants to merge 23 commits intoNVIDIA-NeMo:mainfrom
Jorjeous:audiobench-benchmark
Closed

Add AudioBench and Librispeech-PC benchmarks for speech and audio language models#1043
Jorjeous wants to merge 23 commits intoNVIDIA-NeMo:mainfrom
Jorjeous:audiobench-benchmark

Conversation

@Jorjeous
Copy link
Member

Resolve conflict
Signed-off-by: George Zelenfroind gzelenfroind@nvidia.com

Add audiobench

and fix prepare.py for MMAU-pro

@melllinia melllinia self-requested a review November 14, 2025 14:21
@Jorjeous
Copy link
Member Author

This PR adding evaluation on set's from audiobench and apply's minor fix to manifest format in mmau-pro

To achieve this WER, BlUE score calculation was implemented.
As well as division on Judge | Nonjudge sets

@melllinia melllinia force-pushed the audiobench-benchmark branch from 75c51eb to 97ca5b8 Compare November 21, 2025 15:07
Jorjeous and others added 17 commits November 21, 2025 08:22
Resolve conflict
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: Sadegh Mahdavi <smahdavi@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: msamadi <msamadi@nvidia.com>
Co-authored-by: msamadi <msamadi@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Author did not signed commit
This reverts commit ecfafd1.

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Author sign off is incorrect

This reverts commit 353c202.

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
@Jorjeous Jorjeous force-pushed the audiobench-benchmark branch from 2e81d8e to 2990929 Compare November 21, 2025 16:22
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
# Judge configuration matching AudioBench official implementation
# Using Llama-3.1-70B with vllm (can be overridden in run scripts)
JUDGE_PIPELINE_ARGS = {
"model": "meta-llama/Meta-Llama-3.1-70B-Instruct",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please try to add NVIDIA deployed model instead from this link and check if it works: https://build.nvidia.com/meta/llama-3_1-70b-instruct

Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
@karpnv karpnv requested a review from melllinia November 24, 2025 15:36
- Add comprehensive documentation for LibriSpeech-PC benchmark in speech-audio.md
- Fix jiwer import to be lazy (only import when needed for ASR evaluation)

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>
Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>
@melllinia melllinia changed the title Add AudioBench benchmark for speech and audio language models Add AudioBench and Librispeech-PC benchmarks for speech and audio language models Nov 25, 2025
@gwarmstrong
Copy link
Collaborator

Closed in favor of #1060

@gwarmstrong gwarmstrong closed this Dec 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants