Skip to content

Conversation

@merrymercy
Copy link
Contributor

No description provided.

@merrymercy merrymercy merged commit 70359bf into main Jan 16, 2024
@merrymercy merrymercy deleted the benchmark branch January 16, 2024 00:13
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 12, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Mar 14, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
ch-wan pushed a commit to ch-wan/sglang that referenced this pull request Apr 25, 2025
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request May 28, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request May 28, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Jun 3, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
chunyuan-w added a commit to chunyuan-w/sglang that referenced this pull request Jun 6, 2025
* Use fused_experts_cpu and add weight packing

* add check on whether AMX is supported

* move utils to cpu_utils.py

* address comment

* no need to pass in is_vnni since it's True by default; change inplace to True

* refactor prepack_weight_if_needed

* Only import sgl_kernel.cpu once
pengxin99 pushed a commit to pengxin99/sglang that referenced this pull request Jun 19, 2025
sleepcoo pushed a commit to shuaills/sglang that referenced this pull request Jun 24, 2025
pi314ever pushed a commit to pi314ever/sglang that referenced this pull request Jul 10, 2025
siuhunh pushed a commit to xing-wenjin/sglang that referenced this pull request Jul 21, 2025
yichiche pushed a commit to yichiche/sglang that referenced this pull request Jul 30, 2025
* align shapes

Signed-off-by: Ivan Butygin <[email protected]>

* fix

Signed-off-by: Ivan Butygin <[email protected]>

---------

Signed-off-by: Ivan Butygin <[email protected]>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 7, 2025
* align shapes

Signed-off-by: Ivan Butygin <[email protected]>

* fix

Signed-off-by: Ivan Butygin <[email protected]>

---------

Signed-off-by: Ivan Butygin <[email protected]>
yichiche pushed a commit to yichiche/sglang that referenced this pull request Aug 11, 2025
* align shapes

Signed-off-by: Ivan Butygin <[email protected]>

* fix

Signed-off-by: Ivan Butygin <[email protected]>

---------

Signed-off-by: Ivan Butygin <[email protected]>
Xia-Weiwen pushed a commit to Xia-Weiwen/sglang that referenced this pull request Sep 5, 2025
* set a higher timeout threshold to prevent forced terminated

* disable rope kernel to address the accuracy regression in llama
kalyank007 pushed a commit to kalyank007/sglang that referenced this pull request Nov 7, 2025
amd-youchen referenced this pull request in amd-youchen/sglang Nov 13, 2025
 add pd disaggregation best practices
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants