Skip to content

fixed amd multimodal CI failures caused by refactor in #15812 #15813#16287

Merged
Kangyan-Zhou merged 3 commits intosgl-project:mainfrom
sunxxuns:test-mm-amd-ci
Jan 2, 2026
Merged

fixed amd multimodal CI failures caused by refactor in #15812 #15813#16287
Kangyan-Zhou merged 3 commits intosgl-project:mainfrom
sunxxuns:test-mm-amd-ci

Conversation

@sunxxuns
Copy link
Copy Markdown
Collaborator

@sunxxuns sunxxuns commented Jan 2, 2026

fixed the failure initially caused by #15812 , which was merged without passing mm CI on AMD (pls avoide this future on)
image
1 fixed flashinfer call, since it's only supporting nvidia gpu now, should not be on the main path for other hardwares
2 attn_mask && is_causal doesn't work for sdpa/rocm, flux actually didn't use attn_mask anyway; fixed it for amd path and added a warning
also 2-gpu is broken by #15813
for lora weight load path.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Summary of Changes

Hello @sunxxuns, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request serves as a dedicated test to validate the functionality of the multimodal AMD CI pipeline. By introducing a minimal change—adding a module docstring to a specific Python initialization file—the author intends to activate and monitor the relevant CI jobs. The primary goal is to ensure that the AMD-specific tests are triggered and executed correctly, providing a clear indication of the CI system's health without introducing any functional code changes to be merged.

Highlights

  • CI Test Trigger: A module docstring was added to python/sglang/multimodal_gen/__init__.py specifically to trigger multimodal AMD CI tests.
  • Purpose: This pull request is explicitly a test PR designed to verify that the multimodal AMD CI pipeline is functioning correctly.
  • Expected Outcome: The PR aims to confirm that multimodal-gen-test-1-gpu-amd and multimodal-gen-test-2-gpu-amd jobs run as expected and to identify any test failures.
  • Lifecycle: This PR is intended to be closed after CI verification and should not be merged into the main branch.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@github-actions github-actions bot added the diffusion SGLang Diffusion label Jan 2, 2026
@sunxxuns sunxxuns added the run-ci label Jan 2, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds a module-level docstring to python/sglang/multimodal_gen/__init__.py to trigger CI tests. The change is straightforward and serves its purpose. While this is a test PR that is not intended for merging, I've provided a suggestion to improve the docstring to be more descriptive and align with Python's best practices (PEP 257). A more detailed docstring improves code maintainability by clearly documenting the package's public API, which could be valuable for future development.

@@ -1,4 +1,6 @@
# Copied and adapted from: https://github.com/hao-ai-lab/FastVideo
"""SGLang Multimodal Generation module for diffusion models."""
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

While the added docstring provides a basic summary, it could be more descriptive to improve maintainability and align better with Python's docstring conventions (PEP 257). A good package docstring should briefly explain its purpose and list the main components it exposes. This helps other developers quickly understand the package's public API.

Suggested change
"""SGLang Multimodal Generation module for diffusion models."""
"""SGLang Multimodal Generation package for diffusion models.
This package provides the core components for multimodal generation, including:
- `DiffGenerator`: The main entry point for generating images/videos.
- `PipelineConfig`: Configuration for diffusion pipelines.
- `SamplingParams`: Parameters for controlling the sampling process.
"""
References
  1. According to PEP 257, the docstring for a package (the __init__.py file) should summarize its contents and list exported modules/subpackages. In this case, where key classes are exposed as the public API, listing them in the docstring is a valuable practice that follows the spirit of the guideline to document the package's interface. (link)

@sunxxuns sunxxuns force-pushed the test-mm-amd-ci branch 23 times, most recently from 2d2e71d to bcc8792 Compare January 2, 2026 07:02
- Fix CLIP attention on ROCm: Use is_causal=True alone without attn_mask.
  Using both together causes NaN on ROCm. CUDA path unchanged.
- Add Triton fallback for FlashInfer RoPE with warning
- Add RMSNorm fallback for QK-norm with warning
- Install tvm-ffi in CI for JIT kernel support
… dependencies in `pyproject_other.toml` to match the main `pyproject.toml`.

**Summary of the fix:**

The `qwen_image_t2i_cache_dit_enabled` test requires the `cache-dit` package. While the main `pyproject.toml` already had `cache-dit==1.1.8` in its `diffusion` dependencies, the `pyproject_other.toml` (used for AMD/HIP builds) was missing it. This caused the test to fail on AMD CI unless the workaround in the CI script successfully installed it.

**Changes made:**
- Added `"cache-dit==1.1.8"` to the `diffusion` optional dependencies in `python/pyproject_other.toml`

This ensures that `cache-dit` is properly installed when users run `pip install sglang[diffusion]` on AMD platforms, making the `qwen_image_t2i_cache_dit_enabled` test work correctly.
Copy link
Copy Markdown
Collaborator

@Qiaolin-Yu Qiaolin-Yu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please fix the lint? Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants