fix multiconnector for multi connector use push kv connector#31144
fix multiconnector for multi connector use push kv connector#31144liziyu179 wants to merge 1 commit intovllm-project:mainfrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request addresses a bug in the MultiConnector by correctly passing real KV cache blocks to connectors configured as KV producers with PUT_ASYNC or PUT send types. The change ensures that all relevant connectors receive the necessary data for their operations, improving the correctness of KV transfer in multi-connector setups. The logic appears sound and directly resolves the described issue.
|
Hi @liziyu179, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run You ask your reviewers to trigger select CI tests on top of Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add If you have any questions, please reach out to us on Slack at https://slack.vllm.ai. 🚀 |
Signed-off-by: liziyu <liziyu16@huawei.com>
9d1de9e to
9ce2335
Compare
|
@NickLucche Could you please take a look at the issue addressed by this PR when you have time? |
NickLucche
left a comment
There was a problem hiding this comment.
Thanks for spotting this @liziyu179 !
Personally, I think this level of detail should be abstracted away from the multiconnector, but more generally I am not sure that prioritizing push connectors (with connector-specific format) is the way to go.
I also understand that given the inherent flexibility of the interface, cases like this where some connectors are not "compatible" may arise again.
Perhaps we should consider again some "priority" level of make the inherent one more clear?
@njhill do you have a better idea on how the MultiConnector behavior should be modified to accomodate this case?
cc @sdavidbd who may also be interested in the discussion
|
I agree with @NickLucche that the As I understand it, On the API semantics: I believe Accordingly, if a connector is using For deriving save/send work, the intended mechanism appears to be Concretely, I think the correct fix here is to derive save/send work in That said, I agree the name |
|
I agree with what @sdavidbd wrote, the intention is for @liziyu179 it would help if you could provide an example of a "push" connector implementation of the kind you're talking about here.
@sdavidbd I don't quite understand what you mean by this? |
@njhill I was referring to the |
Purpose
a bugfix for multi connector use push kv connector,pass real blocks to kv connector.
Fixes #31145.
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.