vLLM upgrade #121

ehsk · 2025-12-23T14:36:11Z

This PR upgrades vLLM from 0.8.5.post1 to 0.11.2. Other notable upgrades as a result of this change is torch upgraded to 2.9.0, transformers to 4.57.x and flash-attention to 2.8.3

The vLLM upgrade is needed for Apriel multi-modal training (#111), using new tool parsers, and newer models.

For weight updates in vLLM v1, I followed https://github.com/vllm-project/vllm/blob/v0.11.2/examples/offline_inference/rlhf_utils.py.
Also found a similar code in TRL.

ehsk · 2026-01-18T18:52:20Z

The results don't look like the old vLLM:

GSPO (blue=v0)

Reward	Entropy	AIME'24	MATH-500

GRPO (orange=v0)

Reward	Entropy	AIME'24	MATH-500

A main difference between vLLM v0 and v1 is that in v0, new requests will get blocked until weight update request fulfilled and in v1, new requests will go ahead (with a mix of old/new weights) during a weight update.

The logprobs are the same at the beginning but start to diverge (blue and green are v0):

Leave this PR open for now! And instead, upgrade vllm to a recent version that uses v0, see #122.

ehsk added 2 commits December 23, 2025 14:31

library upgrades + vllm1 weight update changes

771a4f9

unused parameter (device_id) removed

c0dc029

ehsk requested a review from rafapi December 23, 2025 14:57

ehsk added 3 commits January 18, 2026 16:37

transformers dtype warning fixed

28ec2cd

transformers dtype warning fixed

88d6da8

more fixes and updates

a28bdaf

minor logging changes

9eb5264

ehsk mentioned this pull request Jan 19, 2026

vLLM-v0 Upgrade #122

Merged

ehsk closed this pull request by merging all changes into main in 64073e3 Jan 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM upgrade #121

vLLM upgrade #121

Uh oh!

ehsk commented Dec 23, 2025 •

edited

Loading

Uh oh!

ehsk commented Jan 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vLLM upgrade #121

vLLM upgrade #121

Uh oh!

Conversation

ehsk commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ehsk commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ehsk commented Dec 23, 2025 •

edited

Loading

ehsk commented Jan 18, 2026 •

edited

Loading