-
Notifications
You must be signed in to change notification settings - Fork 3.4k
[sgl-kernel] Support PDL for activatons #6722
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Edenzzzz
wants to merge
92
commits into
sgl-project:main
Choose a base branch
from
Edenzzzz:upgrade_act
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+103
−46
Open
Changes from 85 commits
Commits
Show all changes
92 commits
Select commit
Hold shift + click to select a range
1818b07
update to flashinfer 0.2.5
Edenzzzz c82cd7f
Merge branch 'main' into upgrade_act
Edenzzzz c6d6ebe
Update rope and bmm args
Edenzzzz f2bc110
Merge branch 'main' into upgrade_act
Edenzzzz 199a7e4
update llama4 chat template and pythonic parser (#6679)
upfixer dd79f42
feat(tool call): Enhance Llama32Detector for improved JSON parsing in…
CatherineSue d143c1e
Support token-level quantization for EP MoE (#6782)
ch-wan e13c073
Temporarily lower mmlu threshold for triton sliding window backend (#…
NorthmanPKU e622cca
ci: relax test_function_call_required (#6786)
CatherineSue 51dadba
Add intel_amx backend for Radix Attention for CPU (#6408)
yanbing-j 813e7f6
Fix incorrect LoRA weight loading for fused gate_up_proj (#6734)
lifuhuang ec8a3c9
fix(PD-disaggregation): Can not get local ip (#6792)
storyicon 4eb04d5
[FIX] mmmu bench serving result display error (#6525) (#6791)
Arist12 a5abf60
Bump torch to 2.7.0 (#6788)
Qiaolin-Yu 3e1d645
chore: bump sgl-kernel v0.1.5 (#6794)
zhyncs 5403644
Improve profiler and integrate profiler in bench_one_batch_server (#6…
merrymercy a6774a3
chore: upgrade sgl-kernel v0.1.5 (#6795)
zhyncs 9eb5162
[Minor] Always append newline after image token when parsing chat mes…
lifuhuang fc8b63b
Update CI tests for Llama4 models (#6421)
ravi03071991 0c42a31
[Feat] Enable PDL automatically on Hopper architecture (#5981)
PopSoda2002 6fd2dae
chore: update blackwell docker (#6800)
zhyncs 814601b
misc: cache is_hopper_arch (#6799)
Edenzzzz 2431560
set enable_pdl
Edenzzzz ce6fb05
Merge branch 'main' into upgrade_act
Edenzzzz 68062c5
fix
Edenzzzz ec620df
Merge branch 'main' into upgrade_act
Edenzzzz 866cfcf
Merge branch 'main' into upgrade_act
FlamingoPg 6ba7d17
Merge branch 'main' into upgrade_act
Edenzzzz 6ee99b8
Merge branch 'main' into upgrade_act
Edenzzzz 0904613
fix args
Edenzzzz 224d74e
fix
Edenzzzz 6f8195e
fix
Edenzzzz b22d672
Merge branch 'main' into upgrade_act
Edenzzzz 9891438
Merge branch 'main' into upgrade_act
Edenzzzz 9d1239d
Merge branch 'main' into upgrade_act
Edenzzzz 8c6295e
Merge branch 'main' into upgrade_act
Edenzzzz 81c6fa1
fix dtype
Edenzzzz 123b6ba
support blackwell
Edenzzzz 13f81a2
Merge branch 'main' into upgrade_act
Edenzzzz cfe7732
Merge branch 'main' into upgrade_act
Edenzzzz d56f8ff
Merge branch 'main' into upgrade_act
Fridge003 185223e
Merge branch 'main' into upgrade_act
Fridge003 7d0af1c
Merge branch 'main' into upgrade_act
Edenzzzz 67eae34
Merge branch 'main' into upgrade_act
Edenzzzz 79f4146
Merge branch 'main' into upgrade_act
zhyncs 182f046
Merge branch 'main' into upgrade_act
Edenzzzz 9b116df
Merge branch 'main' into upgrade_act
Fridge003 efc6b14
Merge branch 'main' into upgrade_act
Fridge003 6158d77
Merge branch 'main' into upgrade_act
fzyzcjy 20ab53d
Merge branch 'main' into upgrade_act
Fridge003 3cd9e1f
Merge main
Edenzzzz 17700c7
fix
Edenzzzz 12cc9e2
Merge branch 'main' into upgrade_act
Edenzzzz 547f8fd
Merge branch 'main' into upgrade_act
Edenzzzz 0ab683c
Merge branch 'main' into upgrade_act
Edenzzzz f958ca9
Merge branch 'main' into upgrade_act
Edenzzzz b65176d
fix
Edenzzzz a74ab9e
Update sgl-kernel/python/sgl_kernel/elementwise.py
Edenzzzz cf46798
Merge branch 'main' into upgrade_act
Edenzzzz 075c7f6
fix
Edenzzzz 334d797
Merge branch 'main' into upgrade_act
Edenzzzz 044e24b
Merge branch 'main' into upgrade_act
Edenzzzz 4b5835f
Merge main
Edenzzzz 2ea1ef4
fix
Edenzzzz acbfda0
Merge branch 'main' into upgrade_act
Edenzzzz 4eac47a
Merge branch 'main' into upgrade_act
Edenzzzz 7d9f812
Merge branch 'main' into upgrade_act
Fridge003 dad9d0c
Merge branch 'main' into upgrade_act
Edenzzzz bae595f
fix
Edenzzzz b3e29e4
fix
Edenzzzz aeca0d0
Merge branch 'main' into upgrade_act
Edenzzzz 3bf8ef3
fix
Edenzzzz ebb587f
Merge branch 'main' into upgrade_act
FlamingoPg ee883be
Merge branch 'main' into upgrade_act
Edenzzzz 9fc56c2
Merge branch 'main' into upgrade_act
Edenzzzz 2cc892a
Merge branch 'main' into upgrade_act
Fridge003 eda5473
Merge branch 'main' into upgrade_act
Edenzzzz 124d7f8
Merge branch 'main' into upgrade_act
Edenzzzz ff1d2d2
Merge branch 'main' into upgrade_act
Fridge003 232dfe7
Merge branch 'main' into upgrade_act
Edenzzzz a3bc9dc
Merge branch 'main' into upgrade_act
Edenzzzz 440e351
Merge branch 'main' into upgrade_act
Fridge003 9bda166
Merge branch 'main' into upgrade_act
Fridge003 c5b257a
Merge branch 'main' into upgrade_act
Edenzzzz 59bbe7e
Merge branch 'main' into upgrade_act
Fridge003 5235888
Merge branch 'main' into upgrade_act
Edenzzzz 5ff976c
Merge branch 'main' into upgrade_act
Edenzzzz b41e65d
Merge branch 'main' into upgrade_act
Fridge003 6bb801f
Merge branch 'main' into upgrade_act
Fridge003 0478776
Merge branch 'main' into upgrade_act
Fridge003 0f14bf6
try device_guard earlier
Edenzzzz f2c4eb5
Merge branch 'main' into upgrade_act
Fridge003 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.