[Frontend] Add `max-completion-token` option to transcription/translation endpoints by NickLucche · Pull Request #30769 · vllm-project/vllm

NickLucche · 2025-12-16T10:36:14Z

Straightforward PR to allow users to specify max tokens generated on a STT endpoint, using an additional non-OAI argument.

Signed-off-by: NickLucche <nlucches@redhat.com>

chatgpt-codex-connector · 2025-12-16T10:36:22Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request adds a max_completion_tokens option to the transcription and translation endpoints. The implementation in vllm/entrypoints/openai/speech_to_text.py has a critical bug that will cause a TypeError when max_completion_tokens is not provided, and an AttributeError for translation requests. I've provided a suggestion to fix this. To fully support this for translations, max_completion_tokens should also be added to the TranslationRequest protocol. Additionally, a test case for the translation endpoint with this new option would be beneficial.

vllm/entrypoints/openai/speech_to_text.py

mergify · 2025-12-16T11:37:08Z

Hi @NickLucche, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

Signed-off-by: NickLucche <nlucches@redhat.com>

vllm/entrypoints/openai/speech_to_text.py

Signed-off-by: NickLucche <nlucches@redhat.com>

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com>

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com> Signed-off-by: Ubuntu <mjtaheri68@gmail.com>

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com>

init

f94de4b

Signed-off-by: NickLucche <nlucches@redhat.com>

NickLucche requested review from DarkLight1337, aarnphm, chaunceyjiang and robertgshaw2-redhat as code owners December 16, 2025 10:36

mergify bot added the frontend label Dec 16, 2025

gemini-code-assist bot reviewed Dec 16, 2025

View reviewed changes

vllm/entrypoints/openai/speech_to_text.py Outdated Show resolved Hide resolved

NickLucche mentioned this pull request Dec 16, 2025

[Core] WhisperEncoder support torch.compile #30549

Open

translation changes

b7a7f82

Signed-off-by: NickLucche <nlucches@redhat.com>

DarkLight1337 reviewed Dec 16, 2025

View reviewed changes

vllm/entrypoints/openai/speech_to_text.py Outdated Show resolved Hide resolved

fix assignment

a32f2f3

Signed-off-by: NickLucche <nlucches@redhat.com>

DarkLight1337 approved these changes Dec 16, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) December 16, 2025 17:34

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 16, 2025

DarkLight1337 merged commit ca702a1 into vllm-project:main Dec 16, 2025
48 checks passed

NickLucche added a commit to NickLucche/vllm that referenced this pull request Dec 17, 2025

[Frontend] Add max-completion-token option to transcription/transla…

b81b822

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Frontend] Add max-completion-token option to transcription/transla…

ed0b68c

…tion endpoints (vllm-project#30769) Signed-off-by: NickLucche <nlucches@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Add `max-completion-token` option to transcription/translation endpoints#30769

[Frontend] Add `max-completion-token` option to transcription/translation endpoints#30769
DarkLight1337 merged 3 commits intovllm-project:mainfrom
NickLucche:stt-max-tokens

NickLucche commented Dec 16, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

NickLucche commented Dec 16, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants