[Sampler] Support returning all logprobs or logits by 22quinn · Pull Request #21792 · vllm-project/vllm

22quinn · 2025-07-28T23:30:55Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Often for numerical benchmark and debugging purpose, we need to get all the logits or logprobs (controlled by logprobs_mode). This PR supports returning all of them, of vocab_size length, if SampleingParams.logprobs=-1 and ModelConfig.max_logprobs=-1

Test Plan

pytest tests/v1/sample/test_logprobs.py -k 'test_all_logprobs'

Test Result

Test passed

(Optional) Documentation Update

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

github-actions · 2025-07-28T23:31:03Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request introduces support for returning all logprobs or logits by setting logprobs=-1. The changes are well-contained and include updates to configuration, validation, and worker logic, along with a new test case.

My review identified a critical bug in the validation logic. The current implementation would incorrectly allow a request with logprobs=-1 even when the engine has a max_logprobs limit set, which could lead to unexpected resource consumption. I've provided a code suggestion to fix this validation logic. The rest of the implementation appears correct and effectively adds the desired functionality.

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

DarkLight1337 · 2025-07-29T03:49:17Z

cc @njhill

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

houseroad

why do we want to return all the logits? This can easily run into OOM, right?

njhill

This looks reasonable to me but would be good to get some more opinions.

why do we want to return all the logits? This can easily run into OOM, right?

@houseroad yes I think it could cause significant performance problems in a multi-user production scenario but imo it's ok to support this for experimental purposes given that it needs to be explicitly enabled via the config.

22quinn · 2025-07-31T16:56:17Z

yea this is not for production but nice to provide an option for power users to get everything they want (e.g. for debugging).
Btw test failures look irrelevant

houseroad

Sure, I am fine with opt-in solution then. :-)

houseroad · 2025-08-04T04:34:32Z

vllm/config.py

    """Maximum number of log probabilities to return when `logprobs` is
    specified in `SamplingParams`. The default value comes the default for the
-    OpenAI Chat Completions API."""
+    OpenAI Chat Completions API. -1 means no cap."""


Let's add a comment to explicitly call out that -1 is likely causing OOM?

sounds good, updated

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by: Noam Gat <noamgat@gmail.com>

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by: Paul Pak <paulpak58@gmail.com>

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

22quinn added 4 commits July 28, 2025 00:04

logprobs=-1

d9fbec0

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

fix test

1ab10b9

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

fix

b63cd27

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

fix

b5623dc

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

22quinn requested review from WoosukKwon, alexm-redhat, comaniac, hmellor, houseroad, mgoin, njhill, robertgshaw2-redhat, simon-mo, tlrmchlsmth, youkaichao and ywang96 as code owners July 28, 2025 23:30

mergify bot added the v1 label Jul 28, 2025

gemini-code-assist bot reviewed Jul 28, 2025

View reviewed changes

fix bug identified by Gemini

10453c8

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

Merge branch 'main' into more-logprobs

2c84e16

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

22quinn added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 30, 2025

houseroad reviewed Jul 30, 2025

View reviewed changes

njhill reviewed Jul 31, 2025

View reviewed changes

houseroad approved these changes Aug 4, 2025

View reviewed changes

22quinn added 2 commits August 3, 2025 22:45

Merge branch 'main' into more-logprobs

0c48e37

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

oom warning

465e67c

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

vllm-bot merged commit 54de71d into vllm-project:main Aug 4, 2025
38 of 40 checks passed

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[Sampler] Support returning all logprobs or logits (vllm-project#21792)

3700685

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

myselvess pushed a commit to myselvess/vllm that referenced this pull request Aug 7, 2025

[Sampler] Support returning all logprobs or logits (vllm-project#21792)

48bb18b

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

[Sampler] Support returning all logprobs or logits (vllm-project#21792)

9e0b583

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

[Sampler] Support returning all logprobs or logits (vllm-project#21792)

f106539

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

charlotte12l mentioned this pull request Aug 28, 2025

[Sampler] Support returning all prompt logprobs #23868

Merged

5 tasks

charlotte12l mentioned this pull request Sep 17, 2025

[Frontend] Support returning all prompt logprobs #24956

Merged

5 tasks

chaunceyjiang mentioned this pull request Sep 17, 2025

[Frontend] Support setting logprobs to -1 #25031

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Sampler] Support returning all logprobs or logits#21792

[Sampler] Support returning all logprobs or logits#21792
vllm-bot merged 8 commits intovllm-project:mainfrom
22quinn:more-logprobs

22quinn commented Jul 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jul 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 commented Jul 29, 2025

Uh oh!

houseroad left a comment

Uh oh!

njhill left a comment

Uh oh!

22quinn commented Jul 31, 2025

Uh oh!

houseroad left a comment

Uh oh!

houseroad Aug 4, 2025

Uh oh!

22quinn Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

22quinn commented Jul 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

github-actions bot commented Jul 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Jul 29, 2025

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

22quinn commented Jul 31, 2025

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

22quinn Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

22quinn commented Jul 28, 2025 •

edited by github-actions bot

Loading