[fix] [whisper] ensure inputs are moved to the correct device before processing. by AgainstEntropy · Pull Request #22293 · sgl-project/sglang

AgainstEntropy · 2026-04-07T23:38:14Z

Motivation

Whisper was not covered by the Lazy device transfer introduced by #22038 , thus caused a bug that the input features may not be transferred to the correct device.
Can be spotted by the manual test here: https://github.com/sgl-project/sglang/blob/main/test/manual/test_whisper_cuda_graph.py

python test/manual/test_whisper_cuda_graph.py

main branch

...
inputs_embeds = torch.nn.functional.gelu(self.conv1(input_features))
...
return F.conv1d(
...
RuntimeError: Expected all tensors to be on the same device, but got weight is on cuda:0, different from other tensors on cpu (when checking argument in method wrapper_CUDA___slow_conv2d_forward)

this PR
everything good

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

…device before processing.

gemini-code-assist · 2026-04-07T23:38:19Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

yhyang201 · 2026-04-08T00:42:09Z

/tag-and-rerun-ci

mickqian · 2026-04-08T05:07:42Z

        position_ids: torch.Tensor,
        forward_batch: ForwardBatch,
    ):
+        device = self.conv1.weight.device


nit: device=next(self()).device is more robust

Cherry-picked from sgl-project#22293. Ensures input features and position IDs are moved to the correct device before encoder processing.

yhyang201 · 2026-04-08T15:45:15Z

all cuda ci passed.

fix: ensure input features and position IDs are moved to the correct …

ea0fa1e

…device before processing.

yhyang201 approved these changes Apr 8, 2026

View reviewed changes

github-actions bot added the run-ci label Apr 8, 2026

yhyang201 and others added 2 commits April 8, 2026 08:49

Merge branch 'main' into fix/whisper-device

fa3f96f

Merge branch 'main' into fix/whisper-device

270e210

JustinTong0323 mentioned this pull request Apr 8, 2026

[Whisper] Fix audio feature device placement in encoder forward #22296

Closed

2 tasks

yhyang201 added the high priority label Apr 8, 2026

yhyang201 approved these changes Apr 8, 2026

View reviewed changes

mickqian approved these changes Apr 8, 2026

View reviewed changes

JustinTong0323 mentioned this pull request Apr 8, 2026

[Whisper] Batch encoder forward for concurrent prefill requests #22361

Merged

2 tasks

yhyang201 merged commit ae8da14 into sgl-project:main Apr 8, 2026
441 of 527 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] [whisper] ensure inputs are moved to the correct device before processing.#22293

[fix] [whisper] ensure inputs are moved to the correct device before processing.#22293
yhyang201 merged 3 commits intosgl-project:mainfrom
AgainstEntropy:fix/whisper-device

AgainstEntropy commented Apr 7, 2026

Uh oh!

gemini-code-assist bot commented Apr 7, 2026

Uh oh!

yhyang201 commented Apr 8, 2026

Uh oh!

mickqian Apr 8, 2026

Uh oh!

yhyang201 commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AgainstEntropy commented Apr 7, 2026

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

Uh oh!

gemini-code-assist bot commented Apr 7, 2026

Uh oh!

yhyang201 commented Apr 8, 2026

Uh oh!

mickqian Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

yhyang201 commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants