add regression test by hallerite · Pull Request #35834 · vllm-project/vllm

hallerite · 2026-03-03T05:43:16Z

Purpose

Add a regression test verifying that /tokenize correctly expands image placeholder tokens for VLM models.

Before PR #34560, /tokenize did not run the multimodal processor, so it returned ~26 tokens for a message with an image instead of the expected 1451. Confirmed broken on both 0.15.1 and 0.16.0.

Test Plan

pytest tests/entrypoints/openai/test_tokenization_vlm.py -v

Test Result

On HEAD — PASS:
test_tokenize_chat_expands_image_placeholders PASSED
======================= 1 passed in 30.40s ========================

On vllm 0.15.1 / 0.16.0 — FAIL (expected):
/tokenize returned 26 tokens
FAIL — token count 26 is too low, image placeholders were NOT expande

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request adds a regression test to ensure that the /tokenize endpoint correctly expands image placeholders for VLM models. The test is well-structured and targets the specific regression. My feedback includes a suggestion to improve the test's robustness to make it less brittle against future changes in the model or tokenizer.

gemini-code-assist · 2026-03-03T05:44:25Z

+
+    # stop_sign.jpg (1300x876) produces 1451 tokens after expansion.
+    # Without expansion the count would be ~26 (text + one placeholder).
+    assert response.json()["count"] == 1451


Hardcoding the exact token count 1451 makes this test brittle. Minor changes to the model's tokenizer or image processor in the future could cause this test to fail, even if the overall functionality is correct. To make the test more robust, consider asserting a range or a minimum value that clearly distinguishes between the buggy behavior (~26 tokens) and the correct behavior. For example, assert response.json()['count'] > 1000 would still effectively catch the regression while being more resilient to small changes.

Suggested change

assert response.json()["count"] == 1451

assert response.json()["count"] > 1000, "Token count is too low, image placeholders were likely not expanded."

Signed-off-by: hallerite <git@hallerite.com>

github-actions · 2026-03-03T05:56:09Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

DarkLight1337

Thanks

Signed-off-by: hallerite <git@hallerite.com>

hallerite requested review from DarkLight1337, NickLucche, aarnphm and robertgshaw2-redhat as code owners March 3, 2026 05:43

gemini-code-assist Bot reviewed Mar 3, 2026

View reviewed changes

add regression test

bf4a6da

Signed-off-by: hallerite <git@hallerite.com>

hallerite force-pushed the main branch from 700a85e to bf4a6da Compare March 3, 2026 05:49

DarkLight1337 approved these changes Mar 3, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) March 3, 2026 05:57

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 3, 2026

DarkLight1337 merged commit b8401cd into vllm-project:main Mar 3, 2026
14 checks passed

Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026

add regression test (vllm-project#35834)

cf4de48

Signed-off-by: hallerite <git@hallerite.com>

avinashsingh77 pushed a commit to avinashsingh77/vllm that referenced this pull request Mar 12, 2026

add regression test (vllm-project#35834)

4adc6d5

Signed-off-by: hallerite <git@hallerite.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

add regression test (vllm-project#35834)

19cfbb0

Signed-off-by: hallerite <git@hallerite.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add regression test#35834

add regression test#35834
DarkLight1337 merged 1 commit intovllm-project:mainfrom
hallerite:main

hallerite commented Mar 3, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 3, 2026

Uh oh!

github-actions Bot commented Mar 3, 2026

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	assert response.json()["count"] == 1451
	assert response.json()["count"] > 1000, "Token count is too low, image placeholders were likely not expanded."

Uh oh!

Conversation

hallerite commented Mar 3, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Mar 3, 2026

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hallerite commented Mar 3, 2026 •

edited by github-actions Bot

Loading