[Misc] Various cleanups for MM input processing by DarkLight1337 · Pull Request #29970 · vllm-project/vllm

DarkLight1337 · 2025-12-03T15:10:56Z

Purpose

Add vllm.utils.serial_utils.tensor2base64 convenience function
Fix some type annotations for MM embedding input parsing
Fix Terratorch test not being run because of being incorrectly marked async and fix the test failure by allowing tokenizer=None in _preprocess_chat
Remove unused tokenizer argument for MultiModalItemTracker classes, and by extension parse_chat_messages functions

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

mergify · 2025-12-03T15:11:40Z

Documentation preview: https://vllm--29970.org.readthedocs.build/en/29970/

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/entrypoints/chat_utils.py

gemini-code-assist

Code Review

This pull request introduces several valuable cleanups for multimodal inputs. The addition of the tensor2base64 utility function is a great improvement, simplifying code that serializes tensors to base64 and reducing duplication across documentation, examples, and tests. The removal of the unused tokenizer argument from parse_chat_messages and parse_chat_messages_futures cleans up the API nicely. I also appreciate the fix for the Terratorch test and the addition of missing type annotations.

I have one minor suggestion to improve the new utility function further. Overall, this is a solid set of changes that improves code quality and maintainability.

vllm/utils/serial_utils.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-12-03T15:17:35Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces several cleanups and fixes related to multi-modal inputs. A new convenience function tensor2base64 is added to simplify tensor serialization, and it's consistently used across documentation, examples, and tests. A significant refactoring removes the unused tokenizer argument from MultiModalItemTracker and related functions, which cleans up the API. The PR also includes fixes for incorrect type annotations in multi-modal data parsers and corrects a test that was improperly marked as asynchronous. Overall, these changes improve code quality, consistency, and correctness. The implementation looks solid, and I have no further suggestions.

lgeiger · 2025-12-03T16:00:04Z

vllm/utils/serial_utils.py

+        buf.seek(0)
+        binary_data = buf.read()
+
+    return base64.b64encode(binary_data).decode("utf-8")


Should we also use pybase64 here as introduced in #19984, or do you think it's not worth for this case.

Let's do that in a separate PR

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

[Misc] Various cleanups for MM inputs

d5f1f5d

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from Isotr0py December 3, 2025 15:10

DarkLight1337 requested review from aarnphm and robertgshaw2-redhat as code owners December 3, 2025 15:10

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 3, 2025

DarkLight1337 requested review from NickLucche, chaunceyjiang, tjtanaa and ywang96 as code owners December 3, 2025 15:10

Fix

122700c

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

mergify bot added documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) v1 labels Dec 3, 2025

Clean

4b8d979

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

chatgpt-codex-connector bot reviewed Dec 3, 2025

View reviewed changes

vllm/entrypoints/chat_utils.py Show resolved Hide resolved

gemini-code-assist bot reviewed Dec 3, 2025

View reviewed changes

vllm/utils/serial_utils.py Outdated Show resolved Hide resolved

DarkLight1337 added 2 commits December 3, 2025 15:14

Add return type annotation

40ddf61

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

03d821b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist bot reviewed Dec 3, 2025

View reviewed changes

DarkLight1337 changed the title ~~[Misc] Various cleanups for MM inputs~~ [Misc] Various cleanups for MM input processing Dec 3, 2025

Isotr0py approved these changes Dec 3, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) December 3, 2025 15:56

lgeiger reviewed Dec 3, 2025

View reviewed changes

DarkLight1337 added 2 commits December 3, 2025 17:41

Fix

72a2dc6

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Merge branch 'main' into cleanup-misc

c3b83b4

DarkLight1337 merged commit 9ae2f60 into vllm-project:main Dec 4, 2025
57 checks passed

DarkLight1337 deleted the cleanup-misc branch December 4, 2025 06:22

DarkLight1337 mentioned this pull request Dec 4, 2025

Support multiple image/audio embeddings per requests #29988

Merged

QiliangCui mentioned this pull request Dec 4, 2025

lazy load vllm.utils.serial_utils import tensor2base64 to avoid break. #30094

Closed

5 tasks

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[Misc] Various cleanups for MM input processing (vllm-project#29970)

20dbe65

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Misc] Various cleanups for MM input processing#29970

[Misc] Various cleanups for MM input processing#29970
DarkLight1337 merged 7 commits intovllm-project:mainfrom
DarkLight1337:cleanup-misc

DarkLight1337 commented Dec 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Dec 3, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DarkLight1337 commented Dec 3, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

lgeiger Dec 3, 2025

Uh oh!

DarkLight1337 Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

DarkLight1337 commented Dec 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Dec 3, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 commented Dec 3, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

lgeiger Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Dec 3, 2025 •

edited by github-actions bot

Loading