Implement custom dataset class for ASR benchmarking by ymoslem · Pull Request #41576 · vllm-project/vllm

ymoslem · 2026-05-03T22:03:32Z

Added a new function to process a custom audio dataset for ASR benchmarking.

Purpose

This PR adds support for benchmarking ASR (Automatic Speech Recognition) models using a custom local dataset. It introduces:

process_audio(): a utility function that normalizes audio inputs from:
- file paths (via soundfile);
- HuggingFace-style dicts {"array": ..., "sampling_rate": ...}; or
- raw (array, sr) tuples.
CustomAudioDataset: a new dataset class extending CustomDataset that loads audio samples from a JSONL file (e.g., {"prompt": "", "audio": "/path/to/audio.wav"}), processes them via process_audio(), and constructs SampleRequest objects with the audio as multi_modal_data.
CLI support: custom_audio added as a valid --dataset-name choice, with a corresponding elif branch in get_samples().

Latest updates:

Added custom_audio to the --dataset-name choices, CustomAudioDataset class and process_audio function:
- Support ASR models (Whisper tested)
- Support Multimodal (text + audio) models requiring a chat template (Qwen2-Audio tested)
Changed custom_mm to custom_image and CustomMMDataset to CustomImageDataset:
- For now, both custom_mm and custom_image are accepted to keep backward compatibility.

Test Plan 1 (Whisper)

Create a sample JSONL dataset

echo '{"prompt": "", "audio": "/path/to/test.wav"}' > whisper_dataset.jsonl

Start a server

vllm serve openai/whisper-tiny

Run the benchmark

Note: You might need to start another Terminal window, unless you use nohup when starting the server.

vllm bench serve \
  --model openai/whisper-tiny \
  --backend openai-audio \
  --endpoint /v1/audio/transcriptions \
  --dataset-name custom_audio \
  --dataset-path whisper_dataset.jsonl \
  --no-oversample \
  --save-result \
  --save-detailed \
  --result-filename whisper_bench.json

Test Plan 2 (Qwen2-Audio)

Create a sample JSONL dataset.

It is better to have a "prompt" with the required instruction.

echo '{"prompt": "Transcribe the audio.", "audio": "/path/to/test.wav"}' > qwen_dataset.jsonl

Start a server

vllm serve Qwen/Qwen2-Audio-7B-Instruct

Run the benchmark

Note: You might need to start another Terminal window, unless you use nohup when starting the server.

vllm bench serve \
  --model Qwen/Qwen2-Audio-7B-Instruct \
  --backend openai-chat \
  --endpoint /v1/chat/completions \
  --dataset-name custom_audio \
  --dataset-path qwen_dataset.jsonl \
  --no-oversample \
  --enable-multimodal-chat \
  --save-result \
  --save-detailed \
  --result-filename qwen_bench.json

Test Result

By the end of the run:

You should see "Serving Benchmark Result"
If you used the --save-result and --save-detailed options, the whisper_bench.json and qwen_bench.json files should include the results and outputs.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Added audio processing functionality and a custom dataset class for ASR benchmarking. The new features support various audio input formats and allow for sampling from a JSONL dataset. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request introduces support for custom audio datasets in the benchmarking suite by adding a CustomAudioDataset class and a process_audio utility. The review feedback highlights a potential module-level failure due to the top-level import of soundfile without an ImportError check. Additionally, the feedback identifies several inconsistencies in the CustomAudioDataset.sample method, including missing support for the skip_chat_template flag, incorrect handling of null tokenizers, and the omission of logic for the output_tokens field.

ymoslem

Automatic suggestions reviewed

ymoslem · 2026-05-03T22:23:53Z

@ywang96 Would you please review. Thanks!

The soundfile library is imported at the top level without an ImportError check. This will cause the entire datasets module to fail to load if soundfile is not installed, even for users running non-audio benchmarks. Please follow the existing pattern in this file (e.g., for pandas or datasets) by using a try...except block or placeholder module. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

The sample call for custom_audio is missing the skip_chat_template argument. This prevents the --skip-chat-template CLI flag from working correctly for this dataset type, which is inconsistent with the custom dataset implementation. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem

Reviewing suggestions

Add try... except to the soundfile import, and add guards to the audio sample function Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem

Fixed suggestions

mergify · 2026-05-04T02:48:15Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

mergify · 2026-05-04T16:12:05Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

mergify · 2026-05-05T23:50:53Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

mergify · 2026-05-07T19:34:58Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

mergify · 2026-05-10T18:46:10Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

- Adding "custom_audio" to the `--dataset-name` choices, CustomAudioDataset class and process_audio function: - Support ASR models (Whisper tested) - Support Multimodal (text + audio) models requiring a chat template (Qwen2-Audio tested) - Change "custom_mm" to "custom_image" and CustomMMDataset to CustomImageDataset: - For now, both "custom_mm" and "custom_image" are accepted to keep backward compatibility. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

mergify · 2026-05-10T22:27:05Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

Match changes in datasets.py Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Added a deprecation warning for 'custom_mm' dataset. Use '--dataset-name custom_image' instead Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem

Added a deprecation warning for custom_mm. Use --dataset-name custom_image instead.

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem

Updated deprecation warning for custom_mm

DarkLight1337

Can you also update the docs? https://docs.vllm.ai/en/latest/benchmarking/cli/?h=custom_mm#custom-multimodal-dataset

Updated the documentation to reflect changes in dataset naming and usage for image datasets. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

mergify · 2026-05-11T17:43:59Z

Documentation preview: https://vllm--41576.org.readthedocs.build/en/41576/

Added instructions for benchmarking with CustomAudioDataset, including examples for Whisper and Qwen2-Audio models. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

mergify · 2026-05-11T22:18:08Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

mergify · 2026-05-11T22:25:20Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

mergify · 2026-05-11T22:37:55Z

Hi @ymoslem, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

DarkLight1337 · 2026-05-12T04:17:54Z

Thanks for your patience!

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

claude Bot reviewed May 3, 2026

View reviewed changes

mergify Bot added the performance Performance-related issues label May 3, 2026

gemini-code-assist Bot reviewed May 3, 2026

View reviewed changes

Comment thread vllm/benchmarks/datasets/datasets.py Outdated

Comment thread vllm/benchmarks/datasets/datasets.py

Comment thread vllm/benchmarks/datasets/datasets.py

ymoslem commented May 3, 2026

View reviewed changes

ywang96 reviewed May 3, 2026

View reviewed changes

Comment thread vllm/benchmarks/datasets/datasets.py Outdated

ymoslem and others added 3 commits May 4, 2026 00:28

ymoslem commented May 3, 2026

View reviewed changes

Refine soundfile import and the audio sampling function

23c2252

Add try... except to the soundfile import, and add guards to the audio sample function Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem commented May 4, 2026

View reviewed changes

Comment thread vllm/benchmarks/datasets/datasets.py

ymoslem commented May 4, 2026

View reviewed changes

ymoslem requested a review from ywang96 May 4, 2026 00:10

Merge branch 'main' into custom-audio-dataset

2f60e3a

DarkLight1337 reviewed May 4, 2026

View reviewed changes

Comment thread vllm/benchmarks/datasets/datasets.py

DarkLight1337 added the verified Run pre-commit for new contributors without triggering other tests label May 4, 2026

Merge branch 'main' into custom-audio-dataset

362bca1

Merge branch 'main' into custom-audio-dataset

2bb7766

Merge branch 'main' into custom-audio-dataset

2d8c055

Merge branch 'main' into custom-audio-dataset

cd3191a

Add CustomAudioDataset and CustomImageDataset

775ffd3

Match changes in datasets.py Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem added 2 commits May 11, 2026 00:02

pre-commit check

23ad515

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Deprecate 'custom_mm' dataset name with warning

fa3c497

Added a deprecation warning for 'custom_mm' dataset. Use '--dataset-name custom_image' instead Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem commented May 10, 2026

View reviewed changes

ymoslem requested a review from DarkLight1337 May 10, 2026 23:20

DarkLight1337 reviewed May 10, 2026

View reviewed changes

Comment thread vllm/benchmarks/datasets/datasets.py Outdated

Update deprecation warning for custom_mm dataset

08e09bc

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

ymoslem commented May 11, 2026

View reviewed changes

Merge branch 'main' into custom-audio-dataset

395aaa5

DarkLight1337 reviewed May 11, 2026

View reviewed changes

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label May 11, 2026

ymoslem added 2 commits May 11, 2026 18:21

Merge branch 'main' into custom-audio-dataset

982be6f

Rename Custom MM to Custom Image in CLI docs

d932313

Updated the documentation to reflect changes in dataset naming and usage for image datasets. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

DarkLight1337 reviewed May 11, 2026

View reviewed changes

Comment thread docs/benchmarking/cli.md Outdated

mergify Bot added the documentation Improvements or additions to documentation label May 11, 2026

ymoslem added 2 commits May 11, 2026 23:05

Update CLI documentation for CustomAudioDataset

f7fcabe

Added instructions for benchmarking with CustomAudioDataset, including examples for Whisper and Qwen2-Audio models. Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Merge branch 'main' into custom-audio-dataset

69219ae

Fix formatting of model support descriptions

2804410

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Update cli.md

1b168ba

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

Update datasets.py

a9bc9d4

Signed-off-by: Yasmin Moslem <48152713+ymoslem@users.noreply.github.com>

DarkLight1337 approved these changes May 12, 2026

View reviewed changes

DarkLight1337 merged commit 28ee78a into vllm-project:main May 12, 2026
39 checks passed

Uh oh!

Conversation

ymoslem commented May 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Latest updates:

Test Plan 1 (Whisper)

Create a sample JSONL dataset

Start a server

Run the benchmark

Test Plan 2 (Qwen2-Audio)

Create a sample JSONL dataset.

Start a server

Run the benchmark

Test Result

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ymoslem left a comment

Choose a reason for hiding this comment

Uh oh!

ymoslem commented May 3, 2026

Uh oh!

Uh oh!

ymoslem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ymoslem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergify Bot commented May 4, 2026

Uh oh!

mergify Bot commented May 4, 2026

Uh oh!

mergify Bot commented May 5, 2026

Uh oh!

mergify Bot commented May 7, 2026

Uh oh!

mergify Bot commented May 10, 2026

Uh oh!

mergify Bot commented May 10, 2026

Uh oh!

ymoslem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ymoslem left a comment

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergify Bot commented May 11, 2026

Uh oh!

mergify Bot commented May 11, 2026

Uh oh!

mergify Bot commented May 11, 2026

Uh oh!

mergify Bot commented May 11, 2026

Uh oh!

DarkLight1337 commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

ymoslem commented May 3, 2026 •

edited

Loading