[Hardware][TPU] Add supports_async_scheduling() method to Executor interface so that it can be extended for Executor implementations. by gxd3 · Pull Request #36924 · vllm-project/vllm

gxd3 · 2026-03-12T20:54:58Z

[Hardware][TPU] Add supports_async_scheduling() method to Executor interface so that it can be extended for Executor implementations

Purpose

TPU-inference has a custom Executor class that we want to make compatible with async scheduling.
In TPU-inference's custom Executor implementation, we plan to override supports_async_scheduling() to True.

Test Plan

This PR should have no behavior change.
Added a unit test.

Test Result

unit test passed.

…can be extended for different Platforms Signed-off-by: Guangxiang Du <gxd@google.com>

gemini-code-assist

Code Review

This pull request introduces a new executors_supports_async_scheduling() method to the Platform interface, which allows different platforms to specify which distributed executor backends are compatible with asynchronous scheduling. This change replaces a hardcoded list of executors in VllmConfig with a call to this new method, making the system more extensible for platforms like TPU. The default implementation preserves the existing behavior, and a unit test has been added to verify this. The changes are well-structured and correctly implemented.

github-actions · 2026-03-12T21:00:20Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

jikunshang · 2026-03-13T10:05:16Z

I understand your requirement and what you want, I just feel a little strange.
I think vLLM doesn't provide any mechanism to extend Executor class for OOT/plugin yet.
you can say whether a platform support async scheduling or whether an executor support async scheduling, while platform X executor support async scheduling doesn't make much sense to me.
I prefer to add a support_async_scheduling interface in Executor class. but I know it need some reflection to create executor object later, that's why you didn't do such thing.

gxd3 · 2026-03-13T19:33:58Z

Thank you, Kunshang for the quick review!
It's a great point, I updated as suggested, moving support_async_scheduling to Executor interface.
Please take another look again,
Thanks!

…ew comment suggested Signed-off-by: Guangxiang Du <gxd@google.com>

jikunshang · 2026-03-14T01:36:16Z

vllm/config/vllm.py

-            "uni",
-            "external_launcher",
-        )
+        executor_supports_async_sched = Executor.get_class(


I am not certain whether we should do this here.
cc @njhill @LucasWilkinson PTAL

Yeah it is kind of circular but I'm not sure what the alternative is.

Lumosis · 2026-03-17T20:16:12Z

Hi @mgoin @robertgshaw2-redhat! Could you take a look? This is blocking TPU multi-host perf optimization.

njhill

Thanks @gxd3 this looks reasonable to me!

vllm/config/vllm.py

njhill · 2026-03-17T23:05:41Z

vllm/config/vllm.py

-            "uni",
-            "external_launcher",
-        )
+        executor_supports_async_sched = Executor.get_class(


Yeah it is kind of circular but I'm not sure what the alternative is.

Signed-off-by: Guangxiang Du <gxd@google.com>

gxd3 · 2026-03-17T23:15:42Z

Thanks @gxd3 this looks reasonable to me!

Thanks Nick for the review!
Updated the PR as your suggestion, PTAL,
Thank you!

jikunshang · 2026-03-18T04:05:46Z

I add run full ci label, in case any corner case break.

…terface so that it can be extended for Executor implementations. (vllm-project#36924) Signed-off-by: Guangxiang Du <gxd@google.com>

…terface so that it can be extended for Executor implementations. (vllm-project#36924) Signed-off-by: Guangxiang Du <gxd@google.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

…terface so that it can be extended for Executor implementations. (vllm-project#36924) Signed-off-by: Guangxiang Du <gxd@google.com>

…terface so that it can be extended for Executor implementations. (vllm-project#36924) Signed-off-by: Guangxiang Du <gxd@google.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

…terface so that it can be extended for Executor implementations. (vllm-project#36924) Signed-off-by: Guangxiang Du <gxd@google.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

Add executors_supports_async_scheduling() method to Platform so that …

caddbdc

…can be extended for different Platforms Signed-off-by: Guangxiang Du <gxd@google.com>

gxd3 requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners March 12, 2026 20:54

gemini-code-assist bot reviewed Mar 12, 2026

View reviewed changes

gxd3 mentioned this pull request Mar 12, 2026

Support async scheduling with TPU-inference's RayExecutor vllm-project/tpu-inference#1912

Merged

gxd3 requested a review from njhill as a code owner March 13, 2026 19:28

mergify bot added the v1 label Mar 13, 2026

add supports_async_scheduling() to Executor for extensibility as revi…

2e1ca56

…ew comment suggested Signed-off-by: Guangxiang Du <gxd@google.com>

gxd3 force-pushed the gxd3/async-sched-executors-platform-override branch from 29a415c to 2e1ca56 Compare March 13, 2026 19:41

jikunshang reviewed Mar 14, 2026

View reviewed changes

njhill reviewed Mar 17, 2026

View reviewed changes

small update based on comment

e33778f

Signed-off-by: Guangxiang Du <gxd@google.com>

njhill approved these changes Mar 17, 2026

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 17, 2026

gxd3 added 2 commits March 17, 2026 17:00

Merge branch 'main' into gxd3/async-sched-executors-platform-override

f4161e5

Merge branch 'main' into gxd3/async-sched-executors-platform-override

f96f660

jikunshang added the ready-run-all-tests Trigger CI with all tests for wide-ranging PRs label Mar 18, 2026

DarkLight1337 merged commit a0dd199 into vllm-project:main Mar 18, 2026
52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Hardware][TPU] Add supports_async_scheduling() method to Executor interface so that it can be extended for Executor implementations.#36924

[Hardware][TPU] Add supports_async_scheduling() method to Executor interface so that it can be extended for Executor implementations.#36924
DarkLight1337 merged 5 commits intovllm-project:mainfrom
gxd3:gxd3/async-sched-executors-platform-override

gxd3 commented Mar 12, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Mar 12, 2026

Uh oh!

jikunshang commented Mar 13, 2026

Uh oh!

gxd3 commented Mar 13, 2026

Uh oh!

jikunshang Mar 14, 2026

Uh oh!

njhill Mar 17, 2026

Uh oh!

Lumosis commented Mar 17, 2026

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

njhill Mar 17, 2026

Uh oh!

gxd3 commented Mar 17, 2026

Uh oh!

jikunshang commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

gxd3 commented Mar 12, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Mar 12, 2026

Uh oh!

jikunshang commented Mar 13, 2026

Uh oh!

gxd3 commented Mar 13, 2026

Uh oh!

jikunshang Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

njhill Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Lumosis commented Mar 17, 2026

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

njhill Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

gxd3 commented Mar 17, 2026

Uh oh!

jikunshang commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gxd3 commented Mar 12, 2026 •

edited by github-actions bot

Loading