Skip to content

[Fix] fix device orientation for image processor#15859

Merged
Fridge003 merged 11 commits intosgl-project:mainfrom
ZailiWang:cpu_mm_fix
Jan 22, 2026
Merged

[Fix] fix device orientation for image processor#15859
Fridge003 merged 11 commits intosgl-project:mainfrom
ZailiWang:cpu_mm_fix

Conversation

@ZailiWang
Copy link
Contributor

Motivation

In BaseMultimodalProcessor::process_mm_data, the device was set as cuda or npu in a previous PR, leading to error for other backends.

Modifications

Corrected for cpu backend. Other backends may need to do the same to enable the image processor.
The installation of torchaudio is piggy-backed, as latest SGL would require it for multimodal execution.

Accuracy Tests

N/A

Benchmarking and Profiling

N/A

Checklist

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions bot added the documentation Improvements or additions to documentation label Dec 26, 2025
@ZailiWang
Copy link
Contributor Author

/tag-run-ci-label

@mingfeima
Copy link
Collaborator

@blzheng take a look.

@JustinTong0323
Copy link
Collaborator

/tag-and-rerun-ci

@ZailiWang
Copy link
Contributor Author

Hi @JustinTong0323 would you please help merge the PR? I have resolved the conflicts.

@ZailiWang
Copy link
Contributor Author

The CI failures are all timeouts and should be irrelevant to the change in this PR.

@ZailiWang
Copy link
Contributor Author

Hi @JustinTong0323 would you help merge the PR? Thanks.

@Fridge003 Fridge003 merged commit 6a8f68b into sgl-project:main Jan 22, 2026
56 of 99 checks passed
@ZailiWang ZailiWang deleted the cpu_mm_fix branch January 22, 2026 07:05
Johnsonms pushed a commit to Johnsonms/sglang that referenced this pull request Feb 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation Multi-modal multi-modal language model run-ci

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants