-
Notifications
You must be signed in to change notification settings - Fork 624
[DP] Add max_num_tokens_across_dp to AscendMetadata to fix dp and update example #1273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
a79311f to
4b69066
Compare
|
Not sure why CI time out now, try to increase timeout to fix p.s. the same test sucessed in v0.9.1 already |
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
Signed-off-by: MengqingCao <[email protected]>
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #1273 +/- ##
==========================================
- Coverage 27.39% 27.37% -0.03%
==========================================
Files 56 56
Lines 6191 6185 -6
==========================================
- Hits 1696 1693 -3
+ Misses 4495 4492 -3
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
@wangxiyuan plz help merge this and then #1277 , thanks! |
### What this PR does / why we need it? Add `max_num_tokens_across_dp` to AscendMetadata to fix dp This pr fixes the bug introduced by vllm-project#1229, which add an arg `max_num_tokens_across_dp` when dp_size > 1. Signed-off-by: MengqingCao <[email protected]>
### What this PR does / why we need it? Add `max_num_tokens_across_dp` to AscendMetadata to fix dp This pr fixes the bug introduced by vllm-project#1229, which add an arg `max_num_tokens_across_dp` when dp_size > 1. Signed-off-by: MengqingCao <[email protected]>
### What this PR does / why we need it? Add `max_num_tokens_across_dp` to AscendMetadata to fix dp This pr fixes the bug introduced by vllm-project#1229, which add an arg `max_num_tokens_across_dp` when dp_size > 1. Signed-off-by: MengqingCao <[email protected]>
What this PR does / why we need it?
Add
max_num_tokens_across_dpto AscendMetadata to fix dpThis pr fixes the bug introduced by #1229, which add an arg
max_num_tokens_across_dpwhen dp_size > 1.Does this PR introduce any user-facing change?
How was this patch tested?