Skip to content

[Diffusion] Refactor fuse qkv with QKVParallelLinear linear#15090

Merged
mickqian merged 13 commits intomainfrom
restruct_fuse_qkv_with_QKVParallelLinear
Dec 14, 2025
Merged

[Diffusion] Refactor fuse qkv with QKVParallelLinear linear#15090
mickqian merged 13 commits intomainfrom
restruct_fuse_qkv_with_QKVParallelLinear

Conversation

@BBuf
Copy link
Collaborator

@BBuf BBuf commented Dec 14, 2025

Motivation

Follow #14505 , Refactor fuse qkv with QKVParallelLinear linear to unify code style.

sglang generate --model-path black-forest-labs/FLUX.1-dev  \
    --prompt "A logo With Bold Large text: SGL Diffusion" --tp-size 1 --height 1024 --width 1024
图片
sglang generate --model-path black-forest-labs/FLUX.2-dev  \
    --prompt "A logo With Bold Large text: SGL Diffusion" --tp-size 1 --height 1024 --width 1024
图片
sglang generate --model-path Qwen/Qwen-Image --prompt "A logo With Bold Large text: SGL Diffusion" 
图片

Profile

Qwen-Image

main:

图片

pr:

图片

QKVParallelLinear reduce some cpu overhead, sppedup from 248us->176us.

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions bot added documentation Improvements or additions to documentation diffusion SGLang Diffusion labels Dec 14, 2025
@mickqian
Copy link
Collaborator

/tag-and-rerun-ci

@mickqian mickqian force-pushed the restruct_fuse_qkv_with_QKVParallelLinear branch from fe7d68d to bf34996 Compare December 14, 2025 15:40
@mickqian mickqian merged commit 64b5c3a into main Dec 14, 2025
48 of 49 checks passed
@mickqian mickqian deleted the restruct_fuse_qkv_with_QKVParallelLinear branch December 14, 2025 16:33
@BBuf BBuf mentioned this pull request Dec 15, 2025
6 tasks
YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

diffusion SGLang Diffusion documentation Improvements or additions to documentation run-ci

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants