Conversation
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
8225c3b to
ce79104
Compare
|
#1857 I submitted a pull request with this feature, and eliminated the communication overhead for DP metadata when constructing the forward context. Now, I'm waiting for @wangxiyuan to refactor the model runner. |
ok, I abandon modifying this function _get_forward_metadata_across_dp, what time @wangxiyuan to refactor the model runner? I modifying this funciton has_unfinished_dp first |
e121e1d to
2ae3461
Compare
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #2102 +/- ##
==========================================
+ Coverage 75.74% 75.85% +0.10%
==========================================
Files 118 119 +1
Lines 13525 13585 +60
==========================================
+ Hits 10245 10305 +60
Misses 3280 3280
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
As far as I know, |
2ae3461 to
cbe946e
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
cbe946e to
c6c56ff
Compare
e93bbda to
245fead
Compare
4289a31 to
079fe9c
Compare
|
please take a look at this one. vllm-project/vllm#22243 |
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
Signed-off-by: z00811365 <zhaowei6@huawei.com>
079fe9c to
be22296
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |

What this PR does / why we need it?
Implements HCCL for DP has_unfinished_dp communication, resulting in significant performance improvements for large DP

Achieves ~10ms latency reduction with A3 DP size of 128
Does this PR introduce any user-facing change?
How was this patch tested?