Support outlines > 0.0.31 by comaniac · Pull Request #219 · sgl-project/sglang

comaniac · 2024-02-22T19:22:22Z

close #203

This PR provides outline >0.0.31 support so that we could relax the pinned upper version.

The implementation is based on dottxt-ai/outlines@0.0.31...0.0.32

cc @hnyls2002

hnyls2002 · 2024-02-24T07:05:13Z

LGTM

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…t#219)

…l-project#219)" (sgl-project#223) This reverts commit 17b0749.

* Move and adaptation of GDN opt from Qwen3next to Qwen3.5 (sgl-project#215) * rm 3*copy after casual conv, do q/k/v split in causal_conv1d_update_split_qkv kernel during decode time * Add fused Sigmoid and mul triton kernel * add fused gdn gating and sigmoid func for prefill phase * Add fused causal conv1d fwd split qkv kernel * Add fused cumsum_kkt, solve_tril, and merge_recompute kernels for prefill optimization * Increase parallel params for chunk_gated_delta_rule_fwd_kernel * add chunk o kernel opt * Add sigmoid and mul for shared expert module --------- Co-authored-by: yixionghuo <66975556+yixionghuo@users.noreply.github.com> * [Fix] Load weights error in compressed-tensors * Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#219) * Revert "Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#219)" (sgl-project#223) This reverts commit 17b0749. * Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#225) * [Fix] Core dump issue in stress test --------- Co-authored-by: zachYao <zayao@amd.com> Co-authored-by: root <root@hjbog-srdc-29.amd.com>

Support outlines > 0.0.31

a4d5863

hnyls2002 merged commit 3c2c586 into main Feb 24, 2024

hnyls2002 deleted the cody/upgrade_outlines branch February 24, 2024 07:06

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025

Support outlines > 0.0.31 (sgl-project#219)

aad731e

vschandramourya pushed a commit to vschandramourya/sglang that referenced this pull request Feb 3, 2026

Copy OSS code from a0835c3 on 20251011 (sgl-project#219)

16cfb9f

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

IzacharyI added a commit to IzacharyI/sglang that referenced this pull request Mar 17, 2026

Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-projec…

17b0749

…t#219)

IzacharyI pushed a commit to IzacharyI/sglang that referenced this pull request Mar 18, 2026

Revert "Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sg…

2722ef4

…l-project#219)" (sgl-project#223) This reverts commit 17b0749.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support outlines > 0.0.31#219

Support outlines > 0.0.31#219
hnyls2002 merged 1 commit intomainfrom
cody/upgrade_outlines

comaniac commented Feb 22, 2024

Uh oh!

hnyls2002 commented Feb 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

comaniac commented Feb 22, 2024

Uh oh!

hnyls2002 commented Feb 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants