[Frontend] Skip `stop` in reasoning content by chaunceyjiang · Pull Request #24941 · vllm-project/vllm

chaunceyjiang · 2025-09-16T06:23:23Z

Purpose

When the stop="" parameter is set, it causes the reasoning phase to stop when encountering a stop token, which interrupts the process and prevents the user from seeing any content.

Test Plan

from openai import OpenAI

# Modify OpenAI's API key and API base to use vLLM's API server.
openai_api_key = "EMPTY"
openai_api_base = "http://localhost:8000/v1"

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base,
)

models = client.models.list()
model = models.data[0].id

# Round 1
messages = [{"role": "user", "content": "9.11 and 9.8, which is greater?"}]
response = client.chat.completions.create(
    model=model,
    messages=messages,
    stop="9.8",
)

reasoning_content = response.choices[0].message.reasoning_content
content = response.choices[0].message.content

print("reasoning_content for Round 1:", reasoning_content)
print("-" * 80)
print("content for Round 1:", content)

Test Result

reasoning_content for Round 1: 
Okay, so I need to figure out whether 9.11 is greater than 9.8 or not. Let me start by recalling how to compare decimal numbers. I think the general rule is to look at the digits from left to right and compare them one by one. 

First, both numbers are in the ones place. Let me write them down to visualize better:

9.11 and 9.8
...
...
...
So, in conclusion, 9.8 is greater than 9.11.

--------------------------------------------------------------------------------
content for Round 1: 

To determine which number is greater between **9.11** and **

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

chaunceyjiang · 2025-09-16T07:39:02Z

/cc @mgoin @njhill @aarnphm PTAL.

vllm/v1/engine/detokenizer.py

mergify · 2025-09-17T09:52:37Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @chaunceyjiang.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2025-09-21T01:07:29Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @chaunceyjiang.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

chaunceyjiang · 2025-09-22T09:20:52Z

/cc @njhill PTAL.

gaocegege · 2025-09-26T03:18:54Z

I think this only addresses part of the performance problem though. For the other part, I think changes would be needed on the reasoning parser side. There should be a way of doing incremental checking, rather than searching the entire prompt + output tokens for every generated token (while reasoning)...

@njhill I agree with your point. I’ll look into this issue and then submit a new PR. Currently, is_reasoning_end is used in multiple places.

I’ll explore how to perform incremental checks.

Could you please create an issue to keep track? It is a potential problem since day 1, I am also interested in it.

mergify · 2025-10-01T23:31:16Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @chaunceyjiang.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

chaunceyjiang · 2025-10-09T02:40:23Z

I think this only addresses part of the performance problem though. For the other part, I think changes would be needed on the reasoning parser side. There should be a way of doing incremental checking, rather than searching the entire prompt + output tokens for every generated token (while reasoning)...

Hi, @gaocegege @njhill PTAL. #25735

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

mergify · 2025-10-13T09:49:27Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @chaunceyjiang.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

bbartels · 2025-11-12T15:03:52Z

@chaunceyjiang is there something still missing from having this merged?

mergify · 2025-12-05T14:05:21Z

Hi @chaunceyjiang, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

github-actions · 2026-03-06T02:34:32Z

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

mergify bot added v1 tool-calling labels Sep 16, 2025

github-project-automation bot added this to Tool Calling Sep 16, 2025

chaunceyjiang marked this pull request as ready for review September 16, 2025 07:35

chaunceyjiang requested review from DarkLight1337, NickLucche, WoosukKwon, aarnphm, alexm-redhat, comaniac, njhill, robertgshaw2-redhat, simon-mo and ywang96 as code owners September 16, 2025 07:35

chaunceyjiang changed the title ~~[Frontend] Skip stop in reasoning content~~ [Frontend] Skip stop in reasoning content Sep 16, 2025

chaunceyjiang changed the title ~~[Frontend] Skip stop in reasoning content~~ [Frontend] Skip stop in reasoning content Sep 16, 2025

chaunceyjiang changed the title ~~[Frontend] Skip stop in reasoning content~~ [Frontend] Skip stop in reasoning content Sep 16, 2025

njhill reviewed Sep 17, 2025

View reviewed changes

vllm/v1/engine/detokenizer.py Outdated Show resolved Hide resolved

vllm/v1/engine/detokenizer.py Outdated Show resolved Hide resolved

vllm/v1/engine/detokenizer.py Outdated Show resolved Hide resolved

mergify bot added the needs-rebase label Sep 17, 2025

chaunceyjiang force-pushed the stop branch from 87c1d60 to 64b64de Compare September 17, 2025 09:57

mergify bot removed the needs-rebase label Sep 17, 2025

chaunceyjiang requested a review from njhill September 17, 2025 12:56

chaunceyjiang mentioned this pull request Sep 18, 2025

[Frontend] Skip stop in reasoning content #14550

Merged

mergify bot added the needs-rebase label Sep 21, 2025

chaunceyjiang force-pushed the stop branch from 64b64de to a2268a6 Compare September 22, 2025 03:40

mergify bot removed the needs-rebase label Sep 22, 2025

mergify bot added the needs-rebase label Oct 1, 2025

chaunceyjiang added 17 commits October 11, 2025 02:42

[Frontend] Skip stop in reasoning content

7be4cfc

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

c39e872

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

cceda3f

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

e440646

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

47ec813

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

f5e798f

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

2eb823a

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

188933f

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

5f18ec5

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

2968928

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

2bf8278

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

db9ad2b

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

e2d34f6

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

880ac27

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

9f8a4f0

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

6343d1b

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Skip stop in reasoning content

9ad8c67

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang force-pushed the stop branch from e217fec to 9ad8c67 Compare October 11, 2025 02:47

mergify bot removed the needs-rebase label Oct 11, 2025

gaocegege approved these changes Oct 13, 2025

View reviewed changes

mergify bot added the needs-rebase label Oct 13, 2025

github-actions bot added the stale Over 90 days of inactivity label Mar 6, 2026

Uh oh!

Conversation

chaunceyjiang commented Sep 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chaunceyjiang commented Sep 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Sep 17, 2025

Uh oh!

mergify bot commented Sep 21, 2025

Uh oh!

chaunceyjiang commented Sep 22, 2025

Uh oh!

gaocegege commented Sep 26, 2025

Uh oh!

mergify bot commented Oct 1, 2025

Uh oh!

chaunceyjiang commented Oct 9, 2025

Uh oh!

mergify bot commented Oct 13, 2025

Uh oh!

bbartels commented Nov 12, 2025

Uh oh!

mergify bot commented Dec 5, 2025

Uh oh!

github-actions bot commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chaunceyjiang commented Sep 16, 2025 •

edited by github-actions bot

Loading