Remove virtual engine handling by WoosukKwon · Pull Request #30350 · vllm-project/vllm

WoosukKwon · 2025-12-09T16:34:18Z

Summary

remove the virtual engine field from the forward context helpers
simplify KV cache access across attention layers and KV connectors to use a single cache instance
update KV connector tests to align with the new forward context interface

Testing

python -m pytest tests/v1/kv_connector/unit/test_lmcache_integration.py::test_forward_context_interface (fails: ModuleNotFoundError: tblib)
python -m compileall tests/v1/kv_connector/unit/test_nixl_connector.py
python -m compileall tests/v1/kv_connector/unit/test_offloading_connector.py tests/v1/kv_connector/unit/test_decode_bench_connector.py tests/v1/kv_connector/unit/test_lmcache_integration.py

chatgpt-codex-connector · 2025-12-09T16:34:36Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

mergify · 2025-12-09T16:34:59Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @WoosukKwon.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

gemini-code-assist

Code Review

This pull request removes the virtual_engine field and its handling throughout the codebase. The changes simplify KV cache access by assuming a single cache instance instead of a list indexed by virtual_engine. The modifications are consistently applied across various components, including attention layers, KV connectors, model implementations, and tests. The refactoring is clean and aligns with the goal of simplifying the KV cache management. I have reviewed the changes and found no issues.

XingLiu1 · 2026-02-25T06:59:50Z

Hi maintainers, quick question about scope:

Does this PR also cover the TODO in
https://github.com/vllm-project/vllm/blob/main/vllm/v1/worker/gpu/kv_connector.py#L72
("sort out KV Connectors' use of forward_context", introduced in #32742)?

Right now pre_forward() falls back to set_forward_context(None, ...) before
start_load_kv(get_forward_context()) when no forward context is active.
Should that behavior be considered resolved by this PR, or should it be tracked as a separate follow-up issue?

Remove virtual engine handling

ebe0733

WoosukKwon requested review from ApostaC, LucasWilkinson, NickLucche, sighingnow and tdoublep as code owners December 9, 2025 16:34

WoosukKwon added the codex label Dec 9, 2025 — with ChatGPT Codex Connector

mergify bot added qwen Related to Qwen models v1 tpu Related to Google TPUs labels Dec 9, 2025

mergify bot added needs-rebase kv-connector labels Dec 9, 2025

gemini-code-assist bot reviewed Dec 9, 2025

View reviewed changes

robertgshaw2-redhat removed the codex label Dec 20, 2025

WoosukKwon closed this Feb 26, 2026

WoosukKwon deleted the codex/remove-virtual-engine-from-codebase branch February 26, 2026 08:52

njhill mentioned this pull request Mar 16, 2026

[V0 Deprecation] Deprecate virtual engine #37195

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove virtual engine handling#30350

Remove virtual engine handling#30350
WoosukKwon wants to merge 1 commit intomainfrom
codex/remove-virtual-engine-from-codebase

WoosukKwon commented Dec 9, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 9, 2025

Uh oh!

mergify bot commented Dec 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

XingLiu1 commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

WoosukKwon commented Dec 9, 2025

Summary

Testing

Uh oh!

chatgpt-codex-connector bot commented Dec 9, 2025

Uh oh!

mergify bot commented Dec 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

XingLiu1 commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants