-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Fix two issues related to --moe-dense-tp-size=1
#5657
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
43 commits
Select commit
Hold shift + click to select a range
3d2d2e9
disable vocab parallel head
ch-wan d6934d0
llama4 support
ch-wan 8b322c6
Merge remote-tracking branch 'upstream/HEAD' into dev/dp-head
ch-wan 2e2332a
use attn tp group for lm head
ch-wan 24bcd75
fix
ch-wan 14ed913
pass accuracy test
ch-wan 6b43aa5
format
ch-wan c9dde02
use local attn dp size
ch-wan d0a9b99
fix
ch-wan 515f20f
several fix
ch-wan 462f51e
Update .gitignore
ch-wan 5adc5e5
fix refactor
ch-wan 9769217
optimize memory
ch-wan 3b6b6d7
add debug info
ch-wan 16c4b74
format
ch-wan f0674f7
format
ch-wan bad6e91
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
ch-wan 62e05aa
Merge branch 'main' into dev/dp-head
liusy58 27a8ec3
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
liusy58 182aa52
Add `use_attn_tp_group` for user to decide whether to use vocabulary …
liusy58 4712ed0
Add `use_attn_tp_group` for user to decide whether to use vocabulary …
liusy58 c747204
Merge branch 'main' into dev/dp-head
ch-wan 804311d
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
ch-wan 5e8e44e
Rename `use_attn_tp_group` to `enable_dp_lm_head` and refactor the `_…
liusy58 027290c
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
liusy58 8c6ec17
Rename `use_attn_tp_group` to `enable_dp_lm_head` and refactor the `_…
liusy58 71f13f6
Merge branch 'main' into dev/dp-head
ch-wan f7e990f
Merge branch 'main' into dev/dp-head
liusy58 a8e3315
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
liusy58 e75d496
Merge branch 'main' into dev/dp-head
liusy58 efea846
Gather is needed if `enable_dp_lm_head` is not set.
liusy58 f84c245
Update scheduler.py
ch-wan 0f43319
Merge branch 'main' into dev/dp-head
ch-wan e1500ff
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
ch-wan 71c12f6
update code style
ch-wan 160517b
format
ch-wan 5d02170
fix
ch-wan 8f1395a
Merge branch 'dev/dp-head' into dev/fix-dp-ffn-cuda-graph
liusy58 bb61b5c
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
ch-wan bf10e71
Update logits_processor.py
ch-wan 54b9e5b
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
6d006c0
Merge branch 'main' into dev/fix-dp-ffn-cuda-graph
ch-wan 25c838f
rename `dp_rank` to `attn_dp_rank`
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
duplicated