-
Notifications
You must be signed in to change notification settings - Fork 6.4k
[Disagg][NIXL] Add staging buffer support for heterogeneous TP KV transfer #22536
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ShangmingCai
merged 25 commits into
sgl-project:main
from
YAMY1234:feat/nixl-staging-buffer-independent
May 13, 2026
Merged
Changes from all commits
Commits
Show all changes
25 commits
Select commit
Hold shift + click to select a range
e1570fa
[Disagg][NIXL] Add staging buffer support for heterogeneous TP KV tra…
YAMY1234 04f3af9
Fix NIXL staging buffer: use cuMemCreate via custom_mem_pool
YAMY1234 897a402
Refactor NIXL staging buffer: reduce code intrusion, align with moonc…
YAMY1234 ebc4288
Decouple NIXL staging from mooncake: use register_fn callback pattern
YAMY1234 44d3c7c
Restore original type: ignore, getattr, and is_dummy() conventions
YAMY1234 7b25518
Unify staging handler: fix is_dummy bug, generalize PrefillStagingStr…
YAMY1234 615b513
Remove section divider comments added by refactor
YAMY1234 ea9fe3d
Rename decode_info -> dst_info for consistency
YAMY1234 4676ff7
Extract _send_kv_for_req to reduce add_transfer_request complexity
YAMY1234 8065a95
Extract staging notification handlers from update_transfer_status
YAMY1234 3f3fb05
Consolidate staging functions and extract handle_chunk_arrived to com…
YAMY1234 466ebad
Revert _send_kv_for_req extraction, keep inline dispatch in add_trans…
YAMY1234 dbb47c4
Revert unnecessary changes in staging_handler.py
YAMY1234 0a04f45
Restore original field access style in add_transfer_request
YAMY1234 19d7f2d
Move handle_watermark_msg and handle_staging_rsp to common staging_ha…
YAMY1234 d72e9e6
style: format staging buffer code with black
YAMY1234 8aedb1b
fix: limit notification tag split to handle underscores in agent_name
YAMY1234 c9c0d1c
Merge branch 'main' into feat/nixl-staging-buffer-independent
YAMY1234 2e4bc12
refactor(nixl staging): address review feedback
YAMY1234 73cb3bb
Merge upstream main into feat/nixl-staging-buffer-async
YAMY1234 0ebc318
nixl staging: per-worker buffer + dispatch in transfer_worker (moonca…
YAMY1234 c2da8d0
nixl staging: fix prefetch_staging_reqs is_dummy access for NIXL backend
YAMY1234 8c32315
nixl staging: address review feedback (imports / docs / per-room clea…
YAMY1234 bee6bfd
nixl staging: shorten review-feedback comments
YAMY1234 bcb6a69
Merge branch 'main' into feat/nixl-staging-buffer-independent
ShangmingCai File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Mooncake part LGTM