Use standalone_compile by default in torch >= 2.8.0 by zou3519 · Pull Request #18846 · vllm-project/vllm

zou3519 · 2025-05-28T16:41:00Z

This includes the current PyTorch nightlies.

Test Plan:

in Add option to use torch._inductor.standalone_compile #17057, I verified that running https://gist.github.com/zou3519/aebb622714e80f4cd4c369472f2372cd with or without VLLM_TEST_STANDALONE_COMPILE resulted in Inductor producing the same exact output code (via tlparse). I did this for the cold-start case and the warm start case.
there are vllm x torch nightly tests in CI that I will trigger on this PR. Unfortunately these are a bit red on main but the failures are unrelated to this PR.

github-actions · 2025-05-28T16:41:10Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

houseroad · 2025-05-29T00:57:25Z

vllm/envs.py

Wondering why there is TEST in the name?

Also do we want to turn on the flag by default?

The flag was already named VLLM_TEST_STANDALONE_COMPILE in a previous PR. I can rename it if you want, to something like VLLM_USE_STANDALONE_COMPILE, but I was following the naming of VLLM_TEST_DYNAMO_FULLGRAPH_CAPTURE.

This PR turns the flag on by default.

Yeah, let's use something like VLLM_USE_STANDALONE_COMPILE? I feel VLLM_TEST_STANDALONE_COMPILE is misleading, like some flag used in the test only.

houseroad

Looks good, thanks for updating the env var name.

houseroad · 2025-05-29T12:26:32Z

vllm/envs.py

Does internal mean meta internal? If so, it's not internal anymore :-)

internal does not mean "meta internal", it means "not public API". I'll drop the "internal" word.

This includes the current PyTorch nightlies. It also renames the VLLM_TEST_STANDALONE_COMPILE envvar to VLLM_USE_STANDALONE_COMPILE to make it clearer. Test Plan: - in vllm-project#17057, I verified that running https://gist.github.com/zou3519/aebb622714e80f4cd4c369472f2372cd with or without VLLM_TEST_STANDALONE_COMPILE resulted in Inductor producing the same exact output code (via tlparse). I did this for the cold-start case and the warm start case. - there are vllm x torch nightly tests in CI that I will trigger on this PR. Signed-off-by: rzou <zou3519@gmail.com>

drisspg · 2025-05-30T02:22:10Z

Cc @jerryzh168 we shud make sure that we don't land the change disabling compile cache for 2.8+

zou3519 · 2025-05-30T11:58:03Z

Btw, I thought I changed the commit message but looks like I forgot to or it didn't update the PR body. The envvar was renamed to VLLM_USE_STANDALONE_COMPILE in this PR.

Signed-off-by: rzou <zou3519@gmail.com> Signed-off-by: amit <amit.man@gmail.com>

Signed-off-by: rzou <zou3519@gmail.com>

* use 2025.1.1 instead (vllm-project#196) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> * Use standalone_compile by default in torch >= 2.8.0 (vllm-project#18846) Signed-off-by: rzou <zou3519@gmail.com> * fix xpu compile issue --------- Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: rzou <zou3519@gmail.com> Co-authored-by: Richard Zou <zou3519@users.noreply.github.com>

zou3519 added the ready ONLY add when PR is ready to merge/full CI is needed label May 28, 2025

zou3519 marked this pull request as ready for review May 28, 2025 17:11

zou3519 requested review from houseroad and youkaichao May 29, 2025 00:02

houseroad reviewed May 29, 2025

View reviewed changes

zou3519 force-pushed the default_standalone_compile branch from 0c6e52b to d8e7241 Compare May 29, 2025 12:05

zou3519 requested a review from houseroad May 29, 2025 12:15

houseroad approved these changes May 29, 2025

View reviewed changes

zou3519 force-pushed the default_standalone_compile branch from d8e7241 to e239a1a Compare May 29, 2025 12:31

zou3519 force-pushed the default_standalone_compile branch from e239a1a to ec26191 Compare May 29, 2025 12:31

zou3519 mentioned this pull request May 29, 2025

[Bug]: Strange error AssertionError: failed to get the hash of the compiled graph when running Qwen/Qwen3-8B via LLM class #18851

Closed

1 task

houseroad merged commit a521ef0 into vllm-project:main May 29, 2025
63 checks passed

amitm02 pushed a commit to amitm02/vllm that referenced this pull request Jun 1, 2025

Use standalone_compile by default in torch >= 2.8.0 (vllm-project#18846)

ec3b0f4

Signed-off-by: rzou <zou3519@gmail.com> Signed-off-by: amit <amit.man@gmail.com>

amitm02 pushed a commit to amitm02/vllm that referenced this pull request Jun 1, 2025

Use standalone_compile by default in torch >= 2.8.0 (vllm-project#18846)

e5ae9ce

Signed-off-by: rzou <zou3519@gmail.com> Signed-off-by: amit <amit.man@gmail.com>

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Jun 6, 2025

Use standalone_compile by default in torch >= 2.8.0 (vllm-project#18846)

50fdcbc

Signed-off-by: rzou <zou3519@gmail.com>

BoyuanFeng mentioned this pull request Jun 13, 2025

use base version for version comparison #19587

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use standalone_compile by default in torch >= 2.8.0#18846

Use standalone_compile by default in torch >= 2.8.0#18846
houseroad merged 1 commit intovllm-project:mainfrom
zou3519:default_standalone_compile

zou3519 commented May 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!

houseroad May 29, 2025

Uh oh!

zou3519 May 29, 2025

Uh oh!

houseroad May 29, 2025

Uh oh!

zou3519 May 29, 2025

Uh oh!

houseroad left a comment

Uh oh!

houseroad May 29, 2025

Uh oh!

zou3519 May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

drisspg commented May 30, 2025

Uh oh!

zou3519 commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

zou3519 commented May 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!

houseroad May 29, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 May 29, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad May 29, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 May 29, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad May 29, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

drisspg commented May 30, 2025

Uh oh!

zou3519 commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zou3519 commented May 28, 2025 •

edited by github-actions bot

Loading

zou3519 May 29, 2025 •

edited

Loading