Skip to content

Rollback flashmla to older version [1/2]#21430

Merged
Fridge003 merged 1 commit intomainfrom
flashmla-fallback
Mar 26, 2026
Merged

Rollback flashmla to older version [1/2]#21430
Fridge003 merged 1 commit intomainfrom
flashmla-fallback

Conversation

@Fridge003
Copy link
Copy Markdown
Collaborator

Motivation

Temporarily avoid #21291

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/tag-and-rerun-ci

@Fridge003
Copy link
Copy Markdown
Collaborator Author

Fridge003 commented Mar 26, 2026

Results on glm-5-fp8 + DP + fp8 kv cache + flashmla kernels (with #21438 together)

Accuracy: 0.946
Invalid: 0.000
Latency: 25.141 s
Output throughput: 5439.120 token/s

it's back to normal

@Fridge003 Fridge003 changed the title Rollback flashmla to older version Rollback flashmla to older version [1/2] Mar 26, 2026
@Fridge003 Fridge003 merged commit dbe871e into main Mar 26, 2026
24 of 44 checks passed
@Fridge003 Fridge003 deleted the flashmla-fallback branch March 26, 2026 00:49
Fridge003 added a commit that referenced this pull request Apr 2, 2026
satyamk7054 pushed a commit to satyamk7054/sglang that referenced this pull request Apr 3, 2026
JustinTong0323 pushed a commit to JustinTong0323/sglang that referenced this pull request Apr 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant