[Bugfix] Fix correct error message when `len(prompt) + max_tokens > max_model_len` by sducouedic · Pull Request #33425 · vllm-project/vllm

sducouedic · 2026-01-30T14:59:11Z

Fixes the error message, displaying wrong max_model_len

For vllm instance: vllm serve ./llama-194m --max-model-len 2048

Request:

curl -X 'POST'   'http://localhost:8000/v1/completions' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{ "model": "./llama-194m", "prompt": "What is the capital of Paris?",  "max_tokens": 2045 }'

Previously:

{"error":{"message":"This model's maximum context length is 3 tokens. However, your request has 8 input tokens. Please reduce the length of the input messages. (parameter=input_tokens, value=8)","type":"BadRequestError","param":"input_tokens","code":400}}

Now:

{"error":{"message":"This model's maximum context length is 2048 tokens. However, your request has 8 input tokens plus 2045 'max_tokens'. Please reduce one or the other. (parameter=input_tokens, max_tokens, value=(8, 2045))","type":"BadRequestError","param":"input_tokens, max_tokens","code":400}}

cc: @yannicks1

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

gemini-code-assist

Code Review

This pull request provides a helpful bugfix to improve the error message when a user's request exceeds the model's maximum context length due to a long prompt and a large max_tokens value. The previous error message was misleading, and the new one correctly identifies the model's maximum context length and points to both the prompt length and max_tokens as the cause of the issue. The change is well-implemented and significantly improves user experience when encountering this validation error. The implementation is correct for the intended use case, and I have no concerns.

DarkLight1337

Will be addressed as part of #32863

mergify · 2026-01-31T15:02:24Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @sducouedic.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

DarkLight1337 · 2026-01-31T15:07:42Z

Closing as superseded by #32863, thanks for your efforts though!

sducouedic added 3 commits January 30, 2026 15:41

fix error message

ac921ff

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

add both parameters as problematic

2710a60

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

switch paramrs order in error message

a3f50ba

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

sducouedic requested review from aarnphm and chaunceyjiang as code owners January 30, 2026 14:59

mergify bot added frontend bug Something isn't working labels Jan 30, 2026

gemini-code-assist bot reviewed Jan 30, 2026

View reviewed changes

Merge branch 'main' into fix_max_model_len

8cd151e

RishabhSaini mentioned this pull request Jan 30, 2026

renderer: Fix miselading error message #33430

Closed

DarkLight1337 reviewed Jan 31, 2026

View reviewed changes

mergify bot added the needs-rebase label Jan 31, 2026

DarkLight1337 closed this Jan 31, 2026

sducouedic deleted the fix_max_model_len branch February 19, 2026 22:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix correct error message when `len(prompt) + max_tokens > max_model_len`#33425

[Bugfix] Fix correct error message when `len(prompt) + max_tokens > max_model_len`#33425
sducouedic wants to merge 4 commits intovllm-project:mainfrom
sducouedic:fix_max_model_len

sducouedic commented Jan 30, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 left a comment

Uh oh!

mergify bot commented Jan 31, 2026

Uh oh!

DarkLight1337 commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sducouedic commented Jan 30, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Jan 31, 2026

Uh oh!

DarkLight1337 commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sducouedic commented Jan 30, 2026 •

edited by github-actions bot

Loading