[Bugfix] Fix Fish Speech startup on vLLM 0.18.0 - missing architectures by linyueqian · Pull Request #2166 · vllm-project/vllm-omni

linyueqian · 2026-03-25T03:58:18Z

Summary

Fix Fish Speech (and other models without architectures in config.json) failing to start on vLLM 0.18.0.

vLLM 0.18.0 now requires architectures in the model config. Fish Speech s2-pro's config.json doesn't have this field, but the stage config already specifies model_arch. This fix auto-injects model_arch into hf_overrides before ModelConfig creation.

Test plan

# Before: crashes with "No model architectures are specified"
# After: starts successfully
python examples/offline_inference/fish_speech/end2end.py \
    --model fishaudio/s2-pro --text "Hello test"

Tested with s2-pro on A100, offline inference produces valid 1.02s audio @ 44.1kHz.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0fe68c8da6

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-25T04:02:14Z

+            from vllm.transformers_utils.config import get_config
+
+            try:
+                hf_cfg = get_config(self.model, trust_remote_code=self.trust_remote_code)


Avoid loading Hugging Face config twice during startup

This new pre-check calls get_config(...) before super().create_model_config(), and EngineArgs.create_model_config() will load the HF config again when constructing the vLLM ModelConfig. For stages that set model_arch and have no hf_overrides, this adds an extra remote/local config load per stage, which increases startup time and can introduce avoidable transient fetch failures for multi-stage pipelines. Prefer reusing a single loaded config or injecting/merging architectures without a second fetch path.

Useful? React with 👍 / 👎.

linyueqian · 2026-03-25T04:05:32Z

@Sy0307 is this patch necessary? strange that i need to add this for fish speech.

When a model's config.json lacks the `architectures` field (e.g. Fish Speech s2-pro), vLLM 0.18.0's ModelConfig.__init__ raises "No model architectures are specified". The stage config already specifies `model_arch` but it was only used after ModelConfig creation, which is too late. Now injects `model_arch` via `hf_overrides` before calling super().create_model_config(). No extra config load needed — hf_overrides merges cleanly when architectures already exists. Tested with Fish Speech s2-pro and Qwen3-TTS. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: linyueqian <linyueqian@outlook.com>

gcanlin · 2026-03-25T06:43:29Z

Please see #1898. Looks like we don't need it?

Sy0307 · 2026-03-25T07:18:15Z

@Sy0307 is this patch necessary? strange that i need to add this for fish speech.

When I am checking out vllm 0.18.0, the same issue occured. This PR is necessary for me.

gcanlin · 2026-03-25T07:21:05Z

cc @princepride

princepride · 2026-03-25T07:53:31Z

I will check it later.

princepride · 2026-03-25T07:56:14Z

I believe this fix may not solve the real problem, because when we already give the model's architecture, there is no sense we need hf_override to override it.

princepride · 2026-03-25T09:41:12Z

The model_arch in the YAML does not take effect during vLLM's base ModelConfig.init phase.

Timing issue:

OmniEngineArgs picks up self.model_arch = "FishSpeechSlowARForConditionalGeneration" from the YAML.
It calls super().create_model_config() → vLLM's ModelConfig.init runs.
ModelConfig.init reads the architecture from hf_config.architectures → at this point model_arch has not been used yet.
If the HF config.json has no architectures field → crash.
OmniModelConfig.from_vllm_model_config(model_arch=...) is never reached.

model_arch only takes effect after step 5 (via the OmniModelConfig.architectures property override + the hf_config.architectures write-back), but because the fishaudio/s2-pro model's config.json is non-standard and lacks an architectures field, the program already crashes at step 4.

princepride

LGTM

linyueqian · 2026-03-25T20:02:36Z

Closing in favor of #2178 which was already merged with an equivalent fix.

linyueqian requested a review from hsliuustc0106 as a code owner March 25, 2026 03:58

chatgpt-codex-connector Bot reviewed Mar 25, 2026

View reviewed changes

linyueqian force-pushed the fix/fish-speech-architectures branch from 0fe68c8 to dd0f833 Compare March 25, 2026 04:10

Merge branch 'main' into fix/fish-speech-architectures

7046501

princepride approved these changes Mar 25, 2026

View reviewed changes

linyueqian added the ready label to trigger buildkite CI label Mar 25, 2026

linyueqian enabled auto-merge (squash) March 25, 2026 15:39

linyueqian closed this Mar 25, 2026

auto-merge was automatically disabled March 25, 2026 20:02
Pull request was closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix Fish Speech startup on vLLM 0.18.0 - missing architectures#2166

[Bugfix] Fix Fish Speech startup on vLLM 0.18.0 - missing architectures#2166
linyueqian wants to merge 2 commits into
vllm-project:mainfrom
linyueqian:fix/fish-speech-architectures

linyueqian commented Mar 25, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 25, 2026

Uh oh!

linyueqian commented Mar 25, 2026

Uh oh!

gcanlin commented Mar 25, 2026

Uh oh!

Sy0307 commented Mar 25, 2026

Uh oh!

gcanlin commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride left a comment

Uh oh!

linyueqian commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

linyueqian commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

linyueqian commented Mar 25, 2026

Uh oh!

gcanlin commented Mar 25, 2026

Uh oh!

Sy0307 commented Mar 25, 2026

Uh oh!

gcanlin commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride commented Mar 25, 2026

Uh oh!

princepride left a comment

Choose a reason for hiding this comment

Uh oh!

linyueqian commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

linyueqian commented Mar 25, 2026 •

edited

Loading