Align use of vllm_max_model_length in RLOOTrainer by albertvillanova · Pull Request #4702 · huggingface/trl

albertvillanova · 2025-12-16T12:06:03Z

Align use of vllm_max_model_length in RLOOTrainer.

Align it with GRPOTrainer, introduced in PR:

🕵️‍♂️ GRPO: Agent training #4300

This alignment is required to make simpler this PR:

Refactor vLLM generation [1/N]: Extract vLLM generation #4700

HuggingFaceDocBuilderDev · 2025-12-16T12:08:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-12-16T17:20:05Z

trl/trainer/rloo_config.py

            Control the GPU memory utilization for vLLM. This setting only applies when `vllm_mode` is set to
            `"colocate"`. If you are using `vllm_mode="server"`, this parameter must be passed separately when
            launching the vLLM server via the `--vllm_gpu_memory_utilization` flag.
+        vllm_max_model_length (`int`, *optional*, defaults to `None`):


Suggested change

vllm_max_model_length (`int`, *optional*, defaults to `None`):

vllm_max_model_length (`int`, *optional*):

can you possibly apply the same in grpo config?

qgallouedec

lgtm!

Align use of vllm_max_model_length

68b0dc0

albertvillanova mentioned this pull request Dec 16, 2025

Refactor vLLM generation [1/N]: Extract vLLM generation #4700

Merged

albertvillanova mentioned this pull request Dec 16, 2025

Deprecate max_prompt_length in RLOOTrainer #4703

Merged

qgallouedec reviewed Dec 16, 2025

View reviewed changes

Remove redundant type hint

00d631d

qgallouedec approved these changes Dec 16, 2025

View reviewed changes

albertvillanova merged commit 61c9921 into huggingface:main Dec 16, 2025
8 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align use of vllm_max_model_length in RLOOTrainer#4702

Align use of vllm_max_model_length in RLOOTrainer#4702
albertvillanova merged 2 commits intohuggingface:mainfrom
albertvillanova:align-vllm-max-model-length

albertvillanova commented Dec 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 16, 2025

Uh oh!

qgallouedec Dec 16, 2025

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	vllm_max_model_length (`int`, optional, defaults to `None`):
	vllm_max_model_length (`int`, optional):

Conversation

albertvillanova commented Dec 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 16, 2025

Uh oh!

qgallouedec Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants