-
Notifications
You must be signed in to change notification settings - Fork 1.1k
feat(sglang): add ephemeral KV session routing #7665
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
e94e482
feat: add sticky session control for sglang sessions
ishandhanani f7bb91d
refactor(sglang): make agent controller session-only
ishandhanani 429a47f
fix(router): registration and session control client fixes
ishandhanani f3e4aaa
fix: address review comments on session control PR
ishandhanani 834fb44
docs: update agent docs for session control
ishandhanani d4f13c2
feat: add agg_agent.sh launch script and quickstart docs
ishandhanani 291d4a4
docs: fix OpenCode launch command
ishandhanani e0173a3
docs: fix OpenCode fork URL
ishandhanani c96a519
docs: fix OpenCode fork URL to anomalyco/opencode
ishandhanani 83cbff4
revert: restore agg_router.sh to match main
ishandhanani 54d70a6
docs: remove python example and priority eviction section from agent …
ishandhanani 68ab940
docs: expand OpenCode quickstart with provider config and subagent li…
ishandhanani 0c88d13
docs: reference SGLang streaming session PR requirement
ishandhanani 4291cce
refactor: simplify session control, remove dead cache_control code
ishandhanani 59d1185
lint
ishandhanani 665670e
lint
ishandhanani 96caee8
fix: session close leak on drop and affinity bind on failed open
ishandhanani 19ed3f5
go
ishandhanani aa3a6fd
fix: gate session_control endpoint on --enable-streaming-session
ishandhanani fd26501
Merge branch 'main' into idhanani/dyn-ephemeral-kv-sessions
ishandhanani 7ed03b3
fix: use getattr for enable_streaming_session (upstream compat)
ishandhanani cd1da9f
Merge branch 'main' into idhanani/dyn-ephemeral-kv-sessions
ishandhanani 55c9a94
fix(kv-router): correct drop ordering for deferred session close
ishandhanani 31175a7
fix(kv-router): skip sticky resolution for non-session requests
ishandhanani 0dddad4
Merge branch 'main' into idhanani/dyn-ephemeral-kv-sessions
ishandhanani 11f9a2d
refactor(kv-router): make session control request-driven instead of s…
ishandhanani 4d9ac52
fix(kv-router): graceful degradation when streaming sessions unavailable
ishandhanani 77fbf83
Merge remote-tracking branch 'origin/main' into idhanani/dyn-ephemera…
ishandhanani ff8636d
test(sglang): improve streaming session smoke test
ishandhanani facdd23
go
ishandhanani 8cb46c1
fix(clippy): collapse nested if in RequestGuard drop
ishandhanani a340913
style: cargo fmt
ishandhanani 647c110
docs: note SGLang version requirement for streaming sessions
ishandhanani File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.