[Fix] Fix extra uninstall of cutlass packages by Fridge003 · Pull Request #25756 · sgl-project/sglang

Fridge003 · 2026-05-19T07:03:04Z

Motivation

Which might fix the error mentioned in #25743

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

CI States

Latest PR Test (Base): ✅ Run #26087085502
Latest PR Test (Extra): ⚠️ Not enabled -- add run-ci-extra label to opt in.

Fridge003 · 2026-05-19T07:03:26Z

/rerun-test test/registered/lora/test_lora_qwen3_8b_logprob_diff.py

Fridge003 · 2026-05-19T07:03:34Z

/rerun-test test/registered/attention/test_flash_attention_4.py

github-actions · 2026-05-19T07:03:46Z

🚀 1-gpu-h100 (1 test): ❌ View workflow run

cd test/ && python3 registered/lora/test_lora_qwen3_8b_logprob_diff.py

github-actions · 2026-05-19T07:03:59Z

🚀 4-gpu-b200 (1 test): ❌ View workflow run

cd test/ && python3 registered/attention/test_flash_attention_4.py

gemini-code-assist

Code Review

This pull request updates the flash-attn-4 dependency in pyproject.toml by adding the [cu13] extra. The reviewer recommended pinning this dependency to a specific version rather than leaving it unconstrained to ensure build stability and reproducibility.

)" This reverts commit b79e4b1.

Fridge003 · 2026-05-19T08:48:28Z

/rerun-test test/registered/lora/test_lora_qwen3_8b_logprob_diff.py

github-actions · 2026-05-19T08:48:52Z

🚀 1-gpu-h100 (1 test): ❌ View workflow run

cd test/ && python3 registered/lora/test_lora_qwen3_8b_logprob_diff.py

Fridge003 · 2026-05-19T08:58:48Z

/rerun-test test/registered/lora/test_lora_qwen3_8b_logprob_diff.py

github-actions · 2026-05-19T08:59:12Z

🚀 1-gpu-h100 (1 test): ✅ View workflow run

cd test/ && python3 registered/lora/test_lora_qwen3_8b_logprob_diff.py

Fridge003 · 2026-05-19T09:29:50Z

/tag-and-rerun-ci

PR sgl-project#25576 bumped nvidia-cutlass-dsl[cu13] from 4.5.0 to 4.5.1. The bump exposed a latent file-level conflict between -libs-base and -libs-cu13 (both written by the additive [cu13] extra) as a hard GPUModuleOp TypeError on H100: -libs-cu13's pybind11 binding changed to the new MLIR-style ((operation: object)) without a matching bump to the Python wrapper in nvidia-cutlass-dsl, so loading -libs-cu13's .so makes the wrapper's old-style super().__init__() call fail. Two changes: 1. Revert the version bump (4.5.1 -> 4.5.0). At 4.5.0 both .so files expose a compatible binding, so the same coexistence no longer crashes. This removes the active TypeError on H100 and on the CUDA-13 Docker image for non-Blackwell users. 2. Add fix_cutlass_dsl_libs() to ci_install_dependency.sh, called from main() after download_flashinfer_cache. The function picks the right libs package per GPU family even at 4.5.0 to avoid two independent regressions that the silent conflict could still hit: Blackwell (IS_BLACKWELL=1, CU13): Purge -libs-base, force-reinstall -libs-cu13 so its files take precedence. -libs-base is CUDA-12.9-built and lacks the sm_110 arch alias that GB300/B200 need at cutlass import time. Non-Blackwell CU13 (H100, H200): Purge -libs-cu13, force-reinstall -libs-base. -libs-cu13 carries a CUDBG_EXCEPTION_WARP_ILLEGAL_ADDRESS regression in LoRA CUDA- graph capture on sm_90 (sgl-project#25743 / reverted by sgl-project#25756). Non-CU13: no-op (only -libs-base ever installed).

Revert the version bump from PR sgl-project#25576. At 4.5.1, -libs-cu13's pybind11 binding changed to new MLIR-style ((operation: object)) without a matching bump to the Python wrapper in nvidia-cutlass-dsl, exposing the latent file-level conflict between -libs-base and -libs-cu13 (both written by the additive [cu13] extra) as a hard GPUModuleOp TypeError at kernel-compile time on CU13 runners. At 4.5.0 both .so files expose a compatible binding, so the same coexistence is silent and CI was empirically green on H100 and Blackwell during the post-sgl-project#25756, pre-sgl-project#25576 window. Going back to 4.5.0 restores that state. Supersedes sgl-project#25935 (which proposed the same revert but was closed).

upd

e7f9ffd

Fridge003 requested review from ispobock and merrymercy as code owners May 19, 2026 07:03

github-actions Bot added the dependencies Pull requests that update a dependency file label May 19, 2026

gemini-code-assist Bot reviewed May 19, 2026

View reviewed changes

Comment thread python/pyproject.toml Outdated

upd

910ae3b

Fridge003 changed the title ~~[Fix] Upstream FA4 package to latest version~~ [Fix] Fix extra uninstall of cutlass packages May 19, 2026

Fridge003 added 2 commits May 19, 2026 01:47

Revert "[Fix] Try to fix error caused by latest cutedsl packages (#25690

56ff529

)" This reverts commit b79e4b1.

upd

f040d0b

add

70683f5

github-actions Bot added the run-ci label May 19, 2026

Fridge003 merged commit cd012ad into main May 19, 2026
176 of 180 checks passed

Fridge003 deleted the fix-fa4 branch May 19, 2026 17:01

hnyls2002 mentioned this pull request May 19, 2026

verify_done: wait not synchronize #25465

Merged

Kangyan-Zhou mentioned this pull request May 21, 2026

[Revert] nvidia-cutlass-dsl[cu13] 4.5.1 -> 4.5.0 #25938

Merged

Kangyan-Zhou mentioned this pull request May 21, 2026

[CI] Force-reinstall nvidia-cutlass-dsl-libs-cu13 last to avoid wheel-mix TypeError #25958

Merged

Shunkangz pushed a commit to Shunkangz/sglang that referenced this pull request May 27, 2026

[Fix] Fix extra uninstall of cutlass packages (sgl-project#25756)

5b74de3

alphabetc1 pushed a commit to alphabetc1/sglang that referenced this pull request Jun 4, 2026

[Fix] Fix extra uninstall of cutlass packages (sgl-project#25756)

1886302

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fix extra uninstall of cutlass packages#25756

[Fix] Fix extra uninstall of cutlass packages#25756
Fridge003 merged 5 commits into
mainfrom
fix-fa4

Fridge003 commented May 19, 2026 •

edited by github-actions Bot

Loading

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 19, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 •

edited

Loading

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 •

edited

Loading

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Fridge003 commented May 19, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

CI States

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

github-actions Bot commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fridge003 commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fridge003 commented May 19, 2026 •

edited by github-actions Bot

Loading

github-actions Bot commented May 19, 2026 •

edited

Loading

github-actions Bot commented May 19, 2026 •

edited

Loading

github-actions Bot commented May 19, 2026 •

edited

Loading

github-actions Bot commented May 19, 2026 •

edited

Loading