Skip to content

Comments

regression fixes and tests for fix-5119-5121 branch#1

Open
rycerzes wants to merge 4 commits intoalbertvillanova:fix-5119-5121from
rycerzes:fix-5119-5121
Open

regression fixes and tests for fix-5119-5121 branch#1
rycerzes wants to merge 4 commits intoalbertvillanova:fix-5119-5121from
rycerzes:fix-5119-5121

Conversation

@rycerzes
Copy link

@rycerzes rycerzes commented Feb 19, 2026

Pushed commits on top.
Main fix is server-mode chat routing, generate() -> chat() in openenv/utils.py so the vLLM server actually applies the template, with chat_template_kwargs, tools, and chat_template threaded through.

Also added a ValueError at init when both rollout_func and tools are set (tool-call loop was silently passing assembled histories to _generate_single_turn), clarified the rollout_func docstring around the no-flattening contract, and new tests including a PIL Image through the full pipeline. This confirms fix for huggingface#5120

- add TestGRPORolloutDispatch tests: no extra fields, vLLM sync guard when step unchanged
- fix side effect where moving dispatch into  _generate_single_turn caused _tool_call_loop to inadvertently route through rollout_func
tests for conversational prompts pass through without
apply_chat_template, plain string prompts pass through unchanged
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant