[LoRA][I] Add MOE LoRA JIT alignment kernel and tests by yushengsu-thu · Pull Request #19710 · sgl-project/sglang

yushengsu-thu · 2026-03-02T18:50:11Z

Split this PR #14105 into 3 parts - Part I

Add JIT-compiled CUDA kernels for MOE LoRA block size alignment:

moe_lora_align.py: JIT wrapper for moe_lora_align_block_size
moe_lora_align_kernel.cu: CUDA kernels for token alignment, sorting, and expert counting
test_moe_lora_align_block_size.py: Unit tests for the alignment kernel

Made-with: Cursor

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

Add JIT-compiled CUDA kernels for MOE LoRA block size alignment: - moe_lora_align.py: JIT wrapper for moe_lora_align_block_size - moe_lora_align_kernel.cu: CUDA kernels for token alignment, sorting, and expert counting - test_moe_lora_align_block_size.py: Unit tests for the alignment kernel Made-with: Cursor

gemini-code-assist · 2026-03-02T18:50:15Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

yushengsu-thu · 2026-03-02T18:53:10Z

Co-authored-by: Jonah Bernard jb2528@cornell.edu
Co-authored-by: cursor[bot] noreply@cursor.sh

Copilot

Pull request overview

Adds a JIT-compiled CUDA implementation for MoE+LoRA token alignment (block-size padding + per-expert sorting), along with a Python wrapper and a CUDA CI unit test, as part of the larger MoE LoRA enablement work split out from #14105.

Changes:

Introduce moe_lora_align_block_size Python wrapper that JIT-loads a new CUDA kernel.
Add CUDA kernels to build a per-LoRA token mask, align counts to block_size, and sort tokens by expert.
Add a CUDA-registered pytest validating expert assignment and LoRA ownership of sorted blocks.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 7 comments.

File	Description
python/sglang/jit_kernel/moe_lora_align.py	JIT loader + Python entrypoint for the new MOE LoRA alignment kernel.
python/sglang/jit_kernel/csrc/lora/moe_lora_align_kernel.cu	CUDA implementation for token masking, expert counting/padding, and sorting for MoE LoRA alignment.
python/sglang/jit_kernel/tests/test_moe_lora_align_block_size.py	CUDA CI test validating the alignment/sorting results.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python/sglang/jit_kernel/csrc/lora/moe_lora_align_kernel.cu

python/sglang/jit_kernel/tests/test_moe_lora_align_block_size.py

yushengsu-thu · 2026-03-02T19:07:01Z

/tag-and-rerun-ci

python/sglang/jit_kernel/csrc/lora/moe_lora_align_kernel.cu

yushengsu-thu · 2026-03-06T00:18:07Z

/tag-and-rerun-ci

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

) Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Jonah Bernard <96398205+Jonahcb@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 2, 2026 18:50

yushengsu-thu requested review from BBuf, DarkSharpness, HydraQYH and celve as code owners March 2, 2026 18:50

github-actions bot added the lora label Mar 2, 2026

yushengsu-thu requested a review from Fridge003 March 2, 2026 18:50

Copilot started reviewing on behalf of yushengsu-thu March 2, 2026 18:50 View session

yushengsu-thu assigned HydraQYH, Fridge003 and DarkSharpness Mar 2, 2026

yushengsu-thu changed the title ~~Add MOE LoRA JIT alignment kernel and tests~~ [Lora] Add MOE LoRA JIT alignment kernel and tests Mar 2, 2026

yushengsu-thu changed the title ~~[Lora] Add MOE LoRA JIT alignment kernel and tests~~ [Lora][I] Add MOE LoRA JIT alignment kernel and tests Mar 2, 2026

yushengsu-thu changed the title ~~[Lora][I] Add MOE LoRA JIT alignment kernel and tests~~ [LoRA][I] Add MOE LoRA JIT alignment kernel and tests Mar 2, 2026

Merge branch 'main' into moe-lora-jit-kernel

c85ec54

Copilot AI reviewed Mar 2, 2026

View reviewed changes

github-actions bot added the run-ci label Mar 2, 2026

DarkSharpness reviewed Mar 3, 2026

View reviewed changes

python/sglang/jit_kernel/csrc/lora/moe_lora_align_kernel.cu Outdated Show resolved Hide resolved

DarkSharpness reviewed Mar 3, 2026

View reviewed changes

python/sglang/jit_kernel/csrc/lora/moe_lora_align_kernel.cu Outdated Show resolved Hide resolved

yushengsu-thu mentioned this pull request Mar 3, 2026

Development Roadmap - miles LoRA training support Q1 radixark/miles#340

Closed

25 tasks

update

90fd98d

yushengsu-thu requested a review from yuan-luo as a code owner March 6, 2026 00:08

update

589fd26

yushengsu-thu added 2 commits March 6, 2026 00:26

update

151c2f6

pre-sommit

e1c5b80

yushengsu-thu and others added 5 commits March 9, 2026 13:48

Merge branch 'main' into moe-lora-jit-kernel

6e1efd7

Apply suggestions from code review

b137a9f

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into moe-lora-jit-kernel

f9f9662

fix nit

6f9b815

fix pre-commit

83eaada

Fridge003 approved these changes Mar 12, 2026

View reviewed changes

Fridge003 merged commit af2807e into sgl-project:main Mar 12, 2026
241 of 267 checks passed

Jonahcb mentioned this pull request Mar 16, 2026

[LoRA][III] Add LoRA support for MoE layers and enable TP #14105

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA][I] Add MOE LoRA JIT alignment kernel and tests #19710

[LoRA][I] Add MOE LoRA JIT alignment kernel and tests #19710
Fridge003 merged 11 commits intosgl-project:mainfrom
yushengsu-thu:moe-lora-jit-kernel

yushengsu-thu commented Mar 2, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Mar 2, 2026

Uh oh!

yushengsu-thu commented Mar 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yushengsu-thu commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

yushengsu-thu commented Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

yushengsu-thu commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

gemini-code-assist bot commented Mar 2, 2026

Uh oh!

yushengsu-thu commented Mar 2, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yushengsu-thu commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

yushengsu-thu commented Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yushengsu-thu commented Mar 2, 2026 •

edited

Loading