Skip to content

Support outlines > 0.0.31#219

Merged
hnyls2002 merged 1 commit intomainfrom
cody/upgrade_outlines
Feb 24, 2024
Merged

Support outlines > 0.0.31#219
hnyls2002 merged 1 commit intomainfrom
cody/upgrade_outlines

Conversation

@comaniac
Copy link
Copy Markdown
Contributor

close #203

This PR provides outline >0.0.31 support so that we could relax the pinned upper version.

The implementation is based on dottxt-ai/outlines@0.0.31...0.0.32

cc @hnyls2002

@hnyls2002
Copy link
Copy Markdown
Collaborator

LGTM

@hnyls2002 hnyls2002 merged commit 3c2c586 into main Feb 24, 2024
@hnyls2002 hnyls2002 deleted the cody/upgrade_outlines branch February 24, 2024 07:06
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
vschandramourya pushed a commit to vschandramourya/sglang that referenced this pull request Feb 3, 2026
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
IzacharyI added a commit to IzacharyI/sglang that referenced this pull request Mar 17, 2026
IzacharyI pushed a commit to IzacharyI/sglang that referenced this pull request Mar 18, 2026
IzacharyI added a commit to IzacharyI/sglang that referenced this pull request Mar 31, 2026
* Move and adaptation of GDN opt from Qwen3next to Qwen3.5 (sgl-project#215)

* rm 3*copy after casual conv, do q/k/v split in causal_conv1d_update_split_qkv kernel during decode time

* Add fused Sigmoid and mul triton kernel

* add fused gdn gating and  sigmoid func for prefill phase

* Add fused causal conv1d fwd split qkv kernel

* Add fused cumsum_kkt, solve_tril, and merge_recompute kernels for prefill optimization

* Increase parallel params for chunk_gated_delta_rule_fwd_kernel

* add chunk o kernel opt

* Add sigmoid and mul for shared expert module

---------

Co-authored-by: yixionghuo <66975556+yixionghuo@users.noreply.github.com>

* [Fix] Load weights error in compressed-tensors

* Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#219)

* Revert "Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#219)" (sgl-project#223)

This reverts commit 17b0749.

* Fuse 4 GDN input projections (qkv/z/b/a) into single GEMM (sgl-project#225)

* [Fix] Core dump issue in stress test

---------

Co-authored-by: zachYao <zayao@amd.com>
Co-authored-by: root <root@hjbog-srdc-29.amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ImportError: cannot import name 'build_regex_from_object' from 'outlines.fsm.json_schema'

2 participants