fixed amd multimodal CI failures caused by refactor in #15812 #15813 by sunxxuns · Pull Request #16287 · sgl-project/sglang

sunxxuns · 2026-01-02T05:37:47Z

fixed the failure initially caused by #15812 , which was merged without passing mm CI on AMD (pls avoide this future on)

1 fixed flashinfer call, since it's only supporting nvidia gpu now, should not be on the main path for other hardwares
2 attn_mask && is_causal doesn't work for sdpa/rocm, flux actually didn't use attn_mask anyway; fixed it for amd path and added a warning
also 2-gpu is broken by #15813
for lora weight load path.

gemini-code-assist · 2026-01-02T05:37:58Z

Summary of Changes

Hello @sunxxuns, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request serves as a dedicated test to validate the functionality of the multimodal AMD CI pipeline. By introducing a minimal change—adding a module docstring to a specific Python initialization file—the author intends to activate and monitor the relevant CI jobs. The primary goal is to ensure that the AMD-specific tests are triggered and executed correctly, providing a clear indication of the CI system's health without introducing any functional code changes to be merged.

Highlights

CI Test Trigger: A module docstring was added to python/sglang/multimodal_gen/__init__.py specifically to trigger multimodal AMD CI tests.
Purpose: This pull request is explicitly a test PR designed to verify that the multimodal AMD CI pipeline is functioning correctly.
Expected Outcome: The PR aims to confirm that multimodal-gen-test-1-gpu-amd and multimodal-gen-test-2-gpu-amd jobs run as expected and to identify any test failures.
Lifecycle: This PR is intended to be closed after CI verification and should not be merged into the main branch.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds a module-level docstring to python/sglang/multimodal_gen/__init__.py to trigger CI tests. The change is straightforward and serves its purpose. While this is a test PR that is not intended for merging, I've provided a suggestion to improve the docstring to be more descriptive and align with Python's best practices (PEP 257). A more detailed docstring improves code maintainability by clearly documenting the package's public API, which could be valuable for future development.

gemini-code-assist · 2026-01-02T05:39:00Z

python/sglang/multimodal_gen/__init__.py

@@ -1,4 +1,6 @@
 # Copied and adapted from: https://github.com/hao-ai-lab/FastVideo
+"""SGLang Multimodal Generation module for diffusion models."""


While the added docstring provides a basic summary, it could be more descriptive to improve maintainability and align better with Python's docstring conventions (PEP 257). A good package docstring should briefly explain its purpose and list the main components it exposes. This helps other developers quickly understand the package's public API.

Suggested change

"""SGLang Multimodal Generation module for diffusion models."""

"""SGLang Multimodal Generation package for diffusion models.

This package provides the core components for multimodal generation, including:

- `DiffGenerator`: The main entry point for generating images/videos.

- `PipelineConfig`: Configuration for diffusion pipelines.

- `SamplingParams`: Parameters for controlling the sampling process.

"""

References

According to PEP 257, the docstring for a package (the __init__.py file) should summarize its contents and list exported modules/subpackages. In this case, where key classes are exposed as the public API, listing them in the docstring is a valuable practice that follows the spirit of the guideline to document the package's interface. ^(link)

- Fix CLIP attention on ROCm: Use is_causal=True alone without attn_mask. Using both together causes NaN on ROCm. CUDA path unchanged. - Add Triton fallback for FlashInfer RoPE with warning - Add RMSNorm fallback for QK-norm with warning - Install tvm-ffi in CI for JIT kernel support

… dependencies in `pyproject_other.toml` to match the main `pyproject.toml`. **Summary of the fix:** The `qwen_image_t2i_cache_dit_enabled` test requires the `cache-dit` package. While the main `pyproject.toml` already had `cache-dit==1.1.8` in its `diffusion` dependencies, the `pyproject_other.toml` (used for AMD/HIP builds) was missing it. This caused the test to fail on AMD CI unless the workaround in the CI script successfully installed it. **Changes made:** - Added `"cache-dit==1.1.8"` to the `diffusion` optional dependencies in `python/pyproject_other.toml` This ensures that `cache-dit` is properly installed when users run `pip install sglang[diffusion]` on AMD platforms, making the `qwen_image_t2i_cache_dit_enabled` test work correctly.

Qiaolin-Yu

Could you please fix the lint? Thanks!

sunxxuns requested review from mickqian and yhyang201 as code owners January 2, 2026 05:37

github-actions bot added the diffusion SGLang Diffusion label Jan 2, 2026

sunxxuns added the run-ci label Jan 2, 2026

gemini-code-assist bot reviewed Jan 2, 2026

View reviewed changes

mickqian added the DO NOT MERGE label Jan 2, 2026

sunxxuns force-pushed the test-mm-amd-ci branch 23 times, most recently from 2d2e71d to bcc8792 Compare January 2, 2026 07:02

sunxxuns force-pushed the test-mm-amd-ci branch 21 times, most recently from 267ec31 to deb59a2 Compare January 2, 2026 07:15

sunxxuns mentioned this pull request Jan 2, 2026

[Diffusion] Improve qwen image edit performace to align with LightX2V #15812

Merged

sunxxuns added 3 commits January 2, 2026 10:00

fix(amd): workaround LoRA weight_name regression from PR 15813 on ROCm

e40d7eb

Qiaolin-Yu reviewed Jan 2, 2026

View reviewed changes

This was referenced Jan 3, 2026

[Diffusion] safe fallback for fused QK-norm #16329

Closed

diffusion: rotary embedding kernel #14302

Open

ZiguanWang mentioned this pull request Jan 12, 2026

[diffusion]: align sglang diffusion AMD pyproject_other.toml diffusion dependency with pyproject.toml #16225

Merged

5 tasks

thxCode mentioned this pull request Feb 4, 2026

Error occurred when accessing the diffusion model deployed on rocm6.4-sglang0.5.7 gpustack/gpustack#4421

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed amd multimodal CI failures caused by refactor in #15812 #15813#16287

fixed amd multimodal CI failures caused by refactor in #15812 #15813#16287
Kangyan-Zhou merged 3 commits intosgl-project:mainfrom
sunxxuns:test-mm-amd-ci

sunxxuns commented Jan 2, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 2, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 2, 2026

Uh oh!

Qiaolin-Yu left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -1,4 +1,6 @@
		# Copied and adapted from: https://github.com/hao-ai-lab/FastVideo
		"""SGLang Multimodal Generation module for diffusion models."""

-"""SGLang Multimodal Generation module for diffusion models."""
+"""SGLang Multimodal Generation package for diffusion models.
+This package provides the core components for multimodal generation, including:
+- `DiffGenerator`: The main entry point for generating images/videos.
+- `PipelineConfig`: Configuration for diffusion pipelines.
+- `SamplingParams`: Parameters for controlling the sampling process.
+"""

Conversation

sunxxuns commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Jan 2, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

Qiaolin-Yu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sunxxuns commented Jan 2, 2026 •

edited

Loading