regression fixes and tests for fix-5119-5121 branch by rycerzes · Pull Request #1 · albertvillanova/trl

rycerzes · 2026-02-19T19:19:53Z

Pushed commits on top.
Main fix is server-mode chat routing, generate() -> chat() in openenv/utils.py so the vLLM server actually applies the template, with chat_template_kwargs, tools, and chat_template threaded through.

Also added a ValueError at init when both rollout_func and tools are set (tool-call loop was silently passing assembled histories to _generate_single_turn), clarified the rollout_func docstring around the no-flattening contract, and new tests including a PIL Image through the full pipeline. This confirms fix for huggingface#5120

- add TestGRPORolloutDispatch tests: no extra fields, vLLM sync guard when step unchanged

- fix side effect where moving dispatch into _generate_single_turn caused _tool_call_loop to inadvertently route through rollout_func

tests for conversational prompts pass through without apply_chat_template, plain string prompts pass through unchanged

rycerzes added 2 commits February 19, 2026 23:41

fix rollout_func dispatch and server-mode chat routing

e822277

- add TestGRPORolloutDispatch tests: no extra fields, vLLM sync guard when step unchanged

prevent simultaneous use of rollout_func and tools

211dd5a

- fix side effect where moving dispatch into _generate_single_turn caused _tool_call_loop to inadvertently route through rollout_func

rycerzes mentioned this pull request Feb 19, 2026

Decouple rollout dispatch from vLLM backend in GRPO _generate_single_turn huggingface/trl#5122

Open

rycerzes force-pushed the fix-5119-5121 branch 3 times, most recently from 0af8842 to 92848db Compare February 21, 2026 16:14

rycerzes added 2 commits February 23, 2026 12:59

rollout_func docstring for structured prompt formats

f7bb707

tests for conversational prompts pass through without apply_chat_template, plain string prompts pass through unchanged

test for structured multimodal messages in rollout_func

9db804a

rycerzes force-pushed the fix-5119-5121 branch from 92848db to 9db804a Compare February 23, 2026 07:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

regression fixes and tests for fix-5119-5121 branch#1

regression fixes and tests for fix-5119-5121 branch#1
rycerzes wants to merge 4 commits intoalbertvillanova:fix-5119-5121from
rycerzes:fix-5119-5121

rycerzes commented Feb 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

rycerzes commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rycerzes commented Feb 19, 2026 •

edited

Loading