[V0 Deprecation] Drop V0 encoder-decoder runner by WoosukKwon · Pull Request #23300 · vllm-project/vllm

WoosukKwon · 2025-08-21T02:00:24Z

Summary

remove encoder-decoder model runner and utils
delete encoder-decoder examples and tests
drop encoder-decoder models from registry and docs

Testing

pre-commit run --files docs/models/supported_models.md tests/core/block/test_block_manager.py tests/models/registry.py tests/models/test_registry.py tests/test_config.py tests/test_inputs.py vllm/model_executor/models/registry.py vllm/worker/worker.py (failed: command not found)
pip install pre-commit (failed: Tunnel connection failed: 403 Forbidden)
pytest tests/test_inputs.py::test_parse_single_batch_empty -q (failed: ModuleNotFoundError: No module named 'torch')

https://chatgpt.com/codex/tasks/task_b_68a521482e3c832db72d14979dc1ff68

gemini-code-assist

Code Review

This pull request effectively removes the generic encoder-decoder model runner and its associated utilities, tests, and documentation. The changes are consistent with the goal of dropping this functionality. My review focuses on ensuring the removal is clean and complete. I've identified one area where there might be leftover code that could be removed to improve maintainability.

gemini-code-assist · 2025-08-21T02:01:50Z

With the removal of the generic encoder-decoder runner, the is_encoder_decoder property on ModelConfig appears to be obsolete. Its primary user in vllm/worker/worker.py has been removed.

This test now only checks that two decoder-only models are not encoder-decoder models, which provides limited value.

If ModelConfig.is_encoder_decoder is no longer used throughout the codebase, it should be removed from ModelConfig to avoid confusion and dead code. Consequently, this test should also be removed. If the property is still used by specialized models (e.g., Whisper), this test should be updated to include positive test cases for those models to ensure its functionality is still correctly verified.

github-actions · 2025-08-21T02:17:21Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

DarkLight1337 · 2025-08-21T03:04:16Z

This is one of the last use cases which V1 still doesn't support, so I think we should drop this last

WoosukKwon · 2025-08-21T04:34:36Z

@DarkLight1337 We don't plan to continue support these models.

DarkLight1337 · 2025-08-21T04:38:47Z

#21088 should enable it for Whisper at least, it is getting close to being ready

WoosukKwon · 2025-08-21T04:40:09Z

@DarkLight1337 Oh right. Whisper is the only exception. I can wait until the PR is merged.

russellb · 2025-08-21T17:21:03Z

@DarkLight1337 Oh right. Whisper is the only exception. I can wait until the PR is merged.

bart will be a pretty small addition on top, i think. Maybe wait to see how it looks? I was going to look at it next.

hmellor · 2025-08-25T12:10:54Z

With more progress on the Transformers backend side we may be able to run the encoder in Transformers (Transformers can handle any encoder caching) and then the decoder in vLLM (via Transformers backend) where we pass the embeddings from the encoder directly into the decoder in vLLM.

I can't say for certain exactly how this will work as there's a massive ongoing refactor to allow us to swap out the attention module for the vLLM attention module in encoder-decoder models. Unfortunately I don't have a timeline for when this will be ready.

mergify · 2025-08-26T02:44:08Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @WoosukKwon.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

WoosukKwon · 2025-08-27T00:52:50Z

@russellb please don't do it. We will not support any encoder decoder model besides Whisper. Whisper is the ONLY exception.

russellb · 2025-08-27T14:58:19Z

@russellb please don't do it. We will not support any encoder decoder model besides Whisper. Whisper is the ONLY exception.

Got it - was just looking to close gaps for things I know people are using (unfortunately)

NickLucche

I think @russellb 's PR is still relying on a bunch of these tests to ensure correctness.
Can we factor them out to make sure the eventual whisper addition still has reasonable test coverage?

russellb · 2025-09-15T23:33:42Z

@WoosukKwon would you like me to pick this up?

hmellor · 2025-09-16T06:51:19Z

Looks like this work was moved to a new PR (already merged) #24907

WoosukKwon requested review from DarkLight1337, hmellor, tlrmchlsmth, yewentao256 and ywang96 as code owners August 21, 2025 02:00

WoosukKwon added the codex label Aug 21, 2025 — with ChatGPT Codex Connector

WoosukKwon requested review from alexm-redhat, comaniac, njhill, youkaichao and zhuohan123 as code owners August 21, 2025 02:00

mergify Bot added documentation Improvements or additions to documentation llama Related to Llama models multi-modality Related to multi-modality (#4194) new-model Requests to new models labels Aug 21, 2025

gemini-code-assist Bot reviewed Aug 21, 2025

View reviewed changes

WoosukKwon changed the title ~~chore: drop encoder-decoder runner~~ [Chore] Drop V0 encoder-decoder runner Aug 21, 2025

ywang96 approved these changes Aug 21, 2025

View reviewed changes

WoosukKwon changed the title ~~[Chore] Drop V0 encoder-decoder runner~~ [V0 Deprecation] Drop V0 encoder-decoder runner Aug 21, 2025

remove

8b7946d

Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>

WoosukKwon force-pushed the codex/remove-v0-encoder-decoder-model branch from cef656e to 8b7946d Compare August 21, 2025 02:55

hmellor added this to V0 Deprecation Aug 25, 2025

hmellor moved this to In Progress in V0 Deprecation Aug 25, 2025

mergify Bot added the needs-rebase label Aug 26, 2025

russellb mentioned this pull request Aug 28, 2025

[v1] Add Whisper model support (encoder-decoder) #21088

Merged

3 tasks

NickLucche requested changes Aug 29, 2025

View reviewed changes

WoosukKwon closed this Sep 15, 2025

github-project-automation Bot moved this from In Progress to Done in V0 Deprecation Sep 15, 2025

hmellor deleted the codex/remove-v0-encoder-decoder-model branch September 15, 2025 20:18

DarkLight1337 mentioned this pull request Sep 17, 2025

[Misc] Add removed encoder-decoder models to previously supported models list #24961

Merged

5 tasks

Uh oh!

Conversation

WoosukKwon commented Aug 21, 2025

Summary

Testing

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Aug 21, 2025

Uh oh!

DarkLight1337 commented Aug 21, 2025

Uh oh!

WoosukKwon commented Aug 21, 2025

Uh oh!

DarkLight1337 commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon commented Aug 21, 2025

Uh oh!

russellb commented Aug 21, 2025

Uh oh!

hmellor commented Aug 25, 2025

Uh oh!

mergify Bot commented Aug 26, 2025

Uh oh!

WoosukKwon commented Aug 27, 2025

Uh oh!

russellb commented Aug 27, 2025

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

russellb commented Sep 15, 2025

Uh oh!

hmellor commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

DarkLight1337 commented Aug 21, 2025 •

edited

Loading