Skip to content

GRPOTrainer/async: fix prefix EOS slicing for tool suffix (with Qwen3/3.5 type of chat templates)#5330

Merged
qgallouedec merged 12 commits into
huggingface:mainfrom
casinca:qwen3.5-gen-fix
Mar 21, 2026
Merged

GRPOTrainer/async: fix prefix EOS slicing for tool suffix (with Qwen3/3.5 type of chat templates)#5330
qgallouedec merged 12 commits into
huggingface:mainfrom
casinca:qwen3.5-gen-fix

Commits

Commits on Mar 20, 2026

Commits on Mar 21, 2026