-
Notifications
You must be signed in to change notification settings - Fork 1k
[Diffusion]: Diffusion Ring Attention support #273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
0dc25f3
feat: add Ring Attention support for sequence parallelism
mxuax 263c41d
fix: merge Returns docstring into single description
mxuax 7bb7d39
fix: put Returns description on single line for griffe
mxuax d78ef5c
fix: put Returns description on single line for griffe
mxuax c6858e6
chore: sync and trigger CI rebuild
mxuax 04bf625
fix: merge Returns docstring into single description
mxuax e611333
Merge branch 'main' into usp
mxuax e22f68d
refactor: clean up ring attention backend
mxuax 02490dc
Merge branch 'usp' of https://github.com/mxuax/vllm-omni-ring-attn in…
mxuax fe6d67f
vllm-omni-ring-attn\tests\e2e\offline_inference\test_sequence_paralle…
mxuax e235261
remove backward in ring_flash_attn.py and ring_pytorch_attn.py
mxuax e4bdd84
modify test file
mxuax b30971a
fix doc string
mxuax 26d6106
modify test image return type error
mxuax 98dd6b0
modify test image
mxuax d8a09b7
modify test image
mxuax bb29cf7
modify test image
mxuax 70ef57d
Merge branch 'main' into usp
ZJY0516 3f4e265
modify ring_pytorch_attn default backends to be efficient spda
mxuax 727b9af
Merge branch 'usp' of https://github.com/mxuax/vllm-omni-ring-attn in…
mxuax a491391
add debug lines for ci
mxuax c30c755
add debug lines for ci
mxuax b7b0bad
fixed bug test_sp wrongly access output.request_
mxuax aaa41c0
add shm-size: 8gb in pipeline.yml for ring communication requirements
mxuax 86769b3
modify test_comm.py and add it to pipeline.yml to check the p2p commu…
mxuax ae3bd1c
modify the flash call
mxuax d537d16
modify pytorch_attn for continuous tensor passing
mxuax b349eae
finalize test parameer
mxuax c0f1db7
Merge branch 'main' into usp
mxuax 698e7f2
Accelerate Diffusion Parallelism Test
mxuax 6c77f7e
Merge branch 'usp' of https://github.com/mxuax/vllm-omni-ring-attn in…
mxuax 1aea35f
fix time limitation
mxuax ed6182d
Merge branch 'main' into usp
hsliuustc0106 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.