[Deps] cherry-pick #25576 onto release/v0.5.12#25931
Merged
Kangyan-Zhou merged 1 commit intoMay 21, 2026
Merged
Conversation
(cherry picked from commit a449ee4)
Contributor
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Cherry-pick of #25576 (commit a449ee4) onto
release/v0.5.12.Switches
nvidia-cutlass-dslto the[cu13]extra at version4.5.1, matching what is already onmain. This is needed on CUDA 13 images (e.g. GB300) where the non-[cu13]wheel ships CUDA-12.9 binaries that miss thesm_110→sm_101alias and crash FlashAttention 4 / cute kernels at runtime.Changes
python/pyproject.toml:nvidia-cutlass-dsl==4.5.0→nvidia-cutlass-dsl[cu13]==4.5.1Conflict resolution
The source commit on
mainwas a pure version bump ([cu13]==4.5.0→[cu13]==4.5.1) becausemainalready had the[cu13]extra from a prior commit. Onrelease/v0.5.12the[cu13]extra was never picked up, so the line still readnvidia-cutlass-dsl==4.5.0. Took the incoming line so this branch ends up with the same final state asmain([cu13]==4.5.1), which is the intent of the original PR title ("Use cu13 extra for nvidia cutlass dsl").Test Plan
nvidia-cutlass-dslresolves to4.5.1with thecu13extra, and that FA4 / cute kernels no longer crash onsm_110.CI States
Latest PR Test (Base): ❌ Missing
run-cilabel -- add it to run CI tests.Latest PR Test (Extra): ❌ Blocked --
run-ciis required first.