-
Notifications
You must be signed in to change notification settings - Fork 2.8k
feature: Configurable num logprobs in vLLM generation #5107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
LeonEricsson
merged 14 commits into
huggingface:main
from
LeonEricsson:feat/vllm_logprobs_topk
Feb 25, 2026
Merged
Changes from all commits
Commits
Show all changes
14 commits
Select commit
Hold shift + click to select a range
5c2e5ce
configurable logprobs to vllm generation
LeonEricsson e301a9d
fix: RLOO unpack values from vllm generate
LeonEricsson cb71e1e
remove test variable
LeonEricsson b2f3463
squeeze out inner-dim in grpo trainer
LeonEricsson 5da44c6
sort by rank + clarify docstrings on num_logprobs return
LeonEricsson feba256
Merge branch 'main' into feat/vllm_logprobs_topk
qgallouedec 26ffad7
allow None + revert #4848
qgallouedec 202b4d1
test + revert #4848
qgallouedec a10fd14
style + explicit value for logprobs in trainers
qgallouedec 72a1d1e
fix: use string literal for RequestOutput in extract_logprobs function
qgallouedec 5d67f82
revert extra field change
qgallouedec b166440
Merge branch 'main' into feat/vllm_logprobs_topk
qgallouedec f3c9380
style
qgallouedec d967536
Merge branch 'main' into feat/vllm_logprobs_topk
LeonEricsson File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
where is the implementation of the change from list[list[float]] to list[list[list[float]]]? down on line 293, this function is still just getting it out of the response with no modifications.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it's on the server side
trl/trl/scripts/vllm_serve.py
Lines 647 to 654 in 410c00b
the client just passes it through