[Refactor][TPU] Remove torch_xla path and use tpu-inference by weiyu0824 · Pull Request #30808 · vllm-project/vllm

weiyu0824 · 2025-12-16T18:32:26Z

Purpose

Removes torch_xla related code paths as this backend is now deprecated. To run vLLM on TPU, users should now install and use tpu-inference.

Test Plan

Triggered tpu-inference CI/CD pipeline.
Verified that removal does not impact non-TPU backends.

Test Result

tpu-inference CI/CD: https://buildkite.com/tpu-commons/tpu-inference-ci/builds/71

What has been removed

Classes previously migrated to tpu-inference, this is guarded by USE_TPU_INFERENCE before.
Functions explicitly used torch_xla.
All TPU-related test files (migrated to tpu-inference repo)

Pending Removal

Update README.md
Update Dockerfile.tpu

github-actions · 2025-12-16T18:32:36Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

mergify · 2025-12-16T18:33:06Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @weiyu0824.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

gemini-code-assist

Code Review

This pull request is a significant refactoring that removes the deprecated torch_xla based TPU backend. The changes are extensive, involving the deletion of numerous files and code blocks related to the old TPU implementation, including tests, model runners, workers, and various utility functions. The new approach mandates the use of the tpu-inference package for TPU support, and the codebase now reflects this by either importing from tpu-inference or failing with a clear error message if the package is not installed. The code removal appears to be clean and thorough, and the new dependency model is explicit and well-defined. The changes are consistent with the stated purpose of the pull request and represent a solid step forward in simplifying the TPU backend support.

mergify · 2025-12-16T20:21:06Z

Documentation preview: https://vllm--30808.org.readthedocs.build/en/30808/

mergify · 2025-12-22T22:33:38Z

Hi @weiyu0824, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

mergify · 2025-12-22T23:48:45Z

Hi @weiyu0824, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

mergify · 2025-12-23T00:44:15Z

Hi @weiyu0824, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

mergify · 2025-12-23T16:59:14Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @weiyu0824.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

weiyu0824 · 2025-12-25T10:42:45Z

Affected by #31044 (comment)

mergify · 2025-12-29T16:02:51Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @weiyu0824.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2026-01-06T04:15:45Z

Hi @weiyu0824, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

…mplementation Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

mergify · 2026-01-07T04:40:27Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @weiyu0824.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: weiyu <62784299+weiyu0824@users.noreply.github.com>

mergify bot added the v1 label Dec 16, 2025

mergify bot added tpu Related to Google TPUs needs-rebase labels Dec 16, 2025

gemini-code-assist bot reviewed Dec 16, 2025

View reviewed changes

github-project-automation bot added this to gpt-oss Issues & Enhancements Dec 16, 2025

mergify bot added the rocm Related to AMD ROCm label Dec 16, 2025

github-project-automation bot added this to NVIDIA Dec 16, 2025

mergify bot added the structured-output label Dec 16, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Dec 16, 2025

mergify bot added the speculative-decoding label Dec 16, 2025

github-project-automation bot added this to Structured Output Dec 16, 2025

mergify bot added the tool-calling label Dec 16, 2025

github-project-automation bot added this to Tool Calling Dec 16, 2025

mergify bot added the kv-connector label Dec 16, 2025

weiyu0824 closed this Dec 16, 2025

weiyu0824 force-pushed the task/remove-xla-path-new branch from 7f4717f to f21f5ea Compare December 16, 2025 20:32

weiyu0824 force-pushed the task/remove-xla-path-new branch from 99193f9 to 57a924e Compare December 22, 2025 22:29

weiyu0824 requested a review from ApostaC as a code owner December 22, 2025 23:42

weiyu0824 force-pushed the task/remove-xla-path-new branch from 907769b to c2c433a Compare December 22, 2025 23:44

weiyu0824 force-pushed the task/remove-xla-path-new branch from c2c433a to 48755c1 Compare December 23, 2025 00:40

weiyu0824 force-pushed the task/remove-xla-path-new branch 2 times, most recently from 0539c87 to edb6b94 Compare December 23, 2025 04:08

mergify bot added the needs-rebase label Dec 23, 2025

weiyu0824 force-pushed the task/remove-xla-path-new branch from edb6b94 to 373a222 Compare December 23, 2025 18:09

weiyu0824 and others added 12 commits January 7, 2026 03:30

Remove tpu_inference fall back logic

6a013c4

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove torch_xla related code path excluding test files

07123d4

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove tpu-related tests

1831617

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove tpu_int8 as it is related to deleted quantization config and i…

5c56ef5

…mplementation Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove pallas.py as this is migrate to tpu-inference

e8919de

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Run pre-commit to format files

73ee4a3

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Add TODO and remove unused codepath

19fcc94

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Run pre-commit to fix format error

e2383b0

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove MOE xla implementation

f99abbe

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove unused pallas registration

748f2b6

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Remove _use_pallas var as PALLAS attention backend is deprecated

2d9dc7d

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Run pre-commit to fix format error

ccc2ed0

Signed-off-by: Wei-Yu Lin <weiyulin@google.com>

Merge branch 'main' into task/remove-xla-path-new

a866bf5

Signed-off-by: weiyu <62784299+weiyu0824@users.noreply.github.com>

weiyu0824 mentioned this pull request Jan 8, 2026

Address compatibility issues arising from the removal of the XLA dependency vllm-project/tpu-inference#1423

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Refactor][TPU] Remove torch_xla path and use tpu-inference#30808

[Refactor][TPU] Remove torch_xla path and use tpu-inference#30808
DarkLight1337 merged 13 commits intovllm-project:mainfrom
weiyu0824:task/remove-xla-path-new

weiyu0824 commented Dec 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 22, 2025

Uh oh!

mergify bot commented Dec 22, 2025

Uh oh!

mergify bot commented Dec 23, 2025

Uh oh!

mergify bot commented Dec 23, 2025

Uh oh!

weiyu0824 commented Dec 25, 2025

Uh oh!

mergify bot commented Dec 29, 2025

Uh oh!

mergify bot commented Jan 6, 2026

Uh oh!

mergify bot commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

weiyu0824 commented Dec 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 22, 2025

Uh oh!

mergify bot commented Dec 22, 2025

Uh oh!

mergify bot commented Dec 23, 2025

Uh oh!

mergify bot commented Dec 23, 2025

Uh oh!

weiyu0824 commented Dec 25, 2025

Uh oh!

mergify bot commented Dec 29, 2025

Uh oh!

mergify bot commented Jan 6, 2026

Uh oh!

mergify bot commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

weiyu0824 commented Dec 16, 2025 •

edited by github-actions bot

Loading