(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts by christian-pinto · Pull Request #34618 · vllm-project/vllm

christian-pinto · 2026-02-16T11:53:04Z

Purpose

This PR fixes a bug in encode in the LLM entrypoint when using IO Processor plugins.

Previously in encode we would verify that the request is meant to use an IO Processor plugin, and then pass the full prompt to the parse_data of the IO Processor plugin. While we should only pass the prompt.data field to the plugin, similarly to what is done in the online serving mode.

This came out of a discussion in #34214

@DarkLight1337 @staugust

Test Plan

Modified test to pass a properly formatted prompt to vllm when performing the inference in offline mode.

Test Result

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

mergify · 2026-02-16T11:53:52Z

Documentation preview: https://vllm--34618.org.readthedocs.build/en/34618/

gemini-code-assist

Code Review

This pull request aims to fix a bug in the LLM.encode method for IO Processor plugin prompts. The change in vllm/entrypoints/llm.py modifies the argument passed to the io_processor.parse_data method. While the change itself seems to align with the PR description, the corresponding changes in the test and example files appear to contradict the fix, potentially making it ineffective. I've added a critical comment detailing this concern.

vllm/entrypoints/llm.py

DarkLight1337

Thanks for fixing!

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com> Signed-off-by: wzhao18 <wzhao18.sz@gmail.com>

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com> Signed-off-by: Eldar Kurtic <research@neuralmagic.com>

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com> Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

(bugfix): Fixed encode in LLM entrypoint

ed5e459

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

christian-pinto requested review from DarkLight1337 and noooop as code owners February 16, 2026 11:53

christian-pinto mentioned this pull request Feb 16, 2026

add io_process_plugin for sparse embedding #34214

Merged

5 tasks

mergify bot added documentation Improvements or additions to documentation frontend bug Something isn't working labels Feb 16, 2026

gemini-code-assist bot reviewed Feb 16, 2026

View reviewed changes

vllm/entrypoints/llm.py Show resolved Hide resolved

DarkLight1337 approved these changes Feb 16, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) February 16, 2026 13:28

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 16, 2026

vllm-bot merged commit 6930bec into vllm-project:main Feb 16, 2026
52 of 54 checks passed

veeceey mentioned this pull request Feb 16, 2026

Add validation to reject non-text content in system messages #34072

Merged

wzhao18 pushed a commit to wzhao18/vllm that referenced this pull request Feb 18, 2026

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (

9dc39b0

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com> Signed-off-by: wzhao18 <wzhao18.sz@gmail.com>

ZJY0516 pushed a commit to ZJY0516/vllm that referenced this pull request Feb 23, 2026

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (

0791fa6

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com> Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

staugust mentioned this pull request Feb 25, 2026

pass raw request to io_process_plugin #34419

Closed

5 tasks

llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (

b665f09

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts (

09bb8a0

vllm-project#34618) Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts#34618

(bugfix): Fixed encode in LLM entrypoint for IOProcessr plugin prompts#34618
vllm-bot merged 1 commit intovllm-project:mainfrom
christian-pinto:fix_prithvi_tests

christian-pinto commented Feb 16, 2026 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Feb 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

christian-pinto commented Feb 16, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Feb 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

christian-pinto commented Feb 16, 2026 •

edited by github-actions bot

Loading