[misc] refactor deep_gemm compiler for new interface #6194

Alcanderian · 2025-05-11T09:08:04Z

Motivation

DeepSeek-V3-0324 gsm8k accuracy: 0.951

After this upgrade, the compilation speed of both NVCC and NVRTC is truly impressive! First, NVRTC takes about 1s per kernel to compile, while NVCC's compilation time has improved from the previous 4s per kernel to 1.2s~1.3s per kernel. Finally, we no longer have to endure long waits for precompilation!

Refer to deepseek-ai/DeepGEMM@d75b218
NVRTC may have performance loss with some cases and NVCC JIT speed is also 4x faster now. So I keep using NVCC here.

@zhyncs Dependency pipeline:

merge chore: upgrade deepgemm #6073
release new version of sgl-kernel and upload wheel to pip
update srt's sgl-kernel tag
merge this PR

Modifications

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

Alcanderian · 2025-05-11T11:44:58Z

merged into #6196

[misc] refactor deep_gemm compiler for new interface

1f730f9

zhyncs marked this pull request as ready for review May 11, 2025 10:15

zhyncs requested review from HaiShaw, Ying1123, ch-wan, ispobock, merrymercy and zhyncs as code owners May 11, 2025 10:15

Merge branch 'main' into new-deep-gemm-1

f8e0ee6

Alcanderian mentioned this pull request May 11, 2025

chore: upgrade sgl-kernel v0.1.2.post1 #6196

Merged

6 tasks

Alcanderian closed this May 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[misc] refactor deep_gemm compiler for new interface #6194

[misc] refactor deep_gemm compiler for new interface #6194

Uh oh!

Alcanderian commented May 11, 2025 •

edited

Loading

Uh oh!

Alcanderian commented May 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[misc] refactor deep_gemm compiler for new interface #6194

[misc] refactor deep_gemm compiler for new interface #6194

Uh oh!

Conversation

Alcanderian commented May 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Checklist

Uh oh!

Alcanderian commented May 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Alcanderian commented May 11, 2025 •

edited

Loading