[Fix][Fish Speech] Remove redundant get_vocab() in control token encoding by Sy0307 · Pull Request #2842 · vllm-project/vllm-omni

Sy0307 · 2026-04-16T07:51:37Z

Purpose

_encode_control_token() in prompt_utils.py called tokenizer.get_vocab() on every invocation, which rebuilds the full 155K-entry vocabulary dictionary each time (~68ms on H20 GPU). Since this function is called 6 times per prompt (for <|im_start|>, <|im_end|>, <|voice|>), it adds ~408ms of pure Python overhead to every Fish Speech S2 Pro TTS request.

Replace with tokenizer.convert_tokens_to_ids() which performs the same single-token lookup in <1ms.

Test Plan

A/B benchmark: run baseline (origin/main) and fix on the same H20 GPU with identical config (enforce_eager=false, CUDA graph enabled), same model, same text
Verify audio output is valid WAV with correct content
Ruff lint/format pass

Test Result

Setup: enforce_eager=false (torch.compile + CUDA graph), text = "The quick brown fox jumps over the lazy dog." (~3s audio)

	Median Total	Median RTF	Improvement
Baseline (origin/main)	1.205s	0.399	—
This PR	0.908s	0.292	-25%

Long text (~14s audio):

	Median Total	Median RTF	Improvement
Baseline (origin/main)	3.830s	0.269	—
This PR	3.649s	0.248	-8%

Root cause profiling: build_prompt dropped from ~400ms to ~1ms per request.

cc @linyueqian @zwhzzz0821

…ding tokenizer.get_vocab() rebuilds the full 155K-entry vocab dict on every call (~68ms on H20). _encode_control_token() called it 6 times per prompt, adding ~408ms of pure Python overhead to every Fish Speech TTS request. Replace with convert_tokens_to_ids() which does the same lookup in <1ms. Signed-off-by: Sy03 <1370724210@qq.com>

chatgpt-codex-connector · 2026-04-16T07:51:43Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

hsliuustc0106 · 2026-04-16T09:29:16Z

Blocking Issues

None.

VERDICT: COMMENT

Clean performance optimization. The A/B benchmarking evidence in the PR description is solid. LGTM.

(Note: This change is already covered by existing tests since it's an internal optimization with no API changes.)

linyueqian

lgtm

…ding (vllm-project#2842) Signed-off-by: Sy03 <1370724210@qq.com>

Sy0307 requested a review from hsliuustc0106 as a code owner April 16, 2026 07:51

linyueqian approved these changes Apr 16, 2026

View reviewed changes

linyueqian added the ready label to trigger buildkite CI label Apr 16, 2026

linyueqian enabled auto-merge (squash) April 16, 2026 12:08

linyueqian merged commit 322620f into vllm-project:main Apr 16, 2026
8 checks passed

lvliang-intel pushed a commit to lvliang-intel/vllm-omni that referenced this pull request Apr 20, 2026

[Fix][Fish Speech] Remove redundant get_vocab() in control token enco…

1325caf

…ding (vllm-project#2842) Signed-off-by: Sy03 <1370724210@qq.com>

lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026

[Fix][Fish Speech] Remove redundant get_vocab() in control token enco…

a79b2ca

…ding (vllm-project#2842) Signed-off-by: Sy03 <1370724210@qq.com>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[Fix][Fish Speech] Remove redundant get_vocab() in control token enco…

426b1ac

…ding (vllm-project#2842) Signed-off-by: Sy03 <1370724210@qq.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix][Fish Speech] Remove redundant get_vocab() in control token encoding#2842

[Fix][Fish Speech] Remove redundant get_vocab() in control token encoding#2842
linyueqian merged 1 commit into
vllm-project:mainfrom
Sy0307:fix/fish-speech-get-vocab-perf

Sy0307 commented Apr 16, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 16, 2026

Uh oh!

hsliuustc0106 commented Apr 16, 2026

Uh oh!

linyueqian left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sy0307 commented Apr 16, 2026

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 16, 2026

Uh oh!

hsliuustc0106 commented Apr 16, 2026

Blocking Issues

Uh oh!

linyueqian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants