chore: add support for v0.18.0 by rafvasq · Pull Request #857 · torch-spyre/sendnn-inference

rafvasq · 2026-03-23T14:22:29Z

Description

Bump versions to v0.18.0
Add shim to continue on pytorch 2.7.1
Remove deprecated --swap-space param ([V0 Deprecation] Remove unused swap_space parameter vllm-project/vllm#36216)
Set new field cache_config.user_specified_block_size = True to avoid block size overrides.

Related Issues

Unrelated doc failure is fixed in [Docs] Fix build #860
Closes [Feature]: Add support for v0.18.0 #858

Checklist

I have read the contributing guidelines
My code follows the project's code style (run bash format.sh)
I have added tests for my changes (if applicable)
I have updated the documentation (if applicable)
My commits include a Signed-off-by: line (DCO compliance)

github-actions · 2026-03-23T14:22:40Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

Co-authored-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Rafael Vasquez <rafvasq21@gmail.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

Now there is a `user_specified_block_size` variable in CacheConfig. Probably it was introduced to figure out whether the user changed the block size or not. In vllm-spyre platform.py we're not technically the user, but only 64 is valid on Spyre anyway and for some reason, setting the block_size directly no longer workes because it's overwritten with the default of 16. Signed-off-by: Max de Bayser <maxdebayser@gmail.com>

Signed-off-by: Max de Bayser <maxdebayser@gmail.com>

maxdebayser · 2026-03-24T21:02:30Z

bot:test

tjohnson31415 · 2026-03-25T03:53:28Z

bot:test
MARKERS="spyre and prefix_caching and not quantized"

In 2.10, emtpy_cache() is available but throws an error because in vllm-spyre we don´t allocate any accelerator. Since in spyre-next we will do so, I think it's better to add a check before calling empty_cache() instead of just replacing the whole thing by a noop Signed-off-by: Max de Bayser <mbayser@br.ibm.com>

maxdebayser · 2026-03-25T12:42:40Z

bot:test
MARKERS="spyre and prefix_caching and not quantized"

maxdebayser · 2026-03-25T13:55:37Z

@rafvasq , @tjohnson31415 , except for the readthedocs check, all other tests are passing. Since this PR doesn't touch the docs, I think it's not a blocker.

rafvasq · 2026-03-25T14:15:05Z

Thanks @maxdebayser, yes the unrelated doc failure is fixed in #860.

Signed-off-by: Max de Bayser <mbayser@br.ibm.com>

maxdebayser · 2026-03-25T15:52:35Z

bot:test
MARKERS="spyre and prefix_caching and not quantized"

maxdebayser · 2026-03-25T16:59:17Z

bot:test
MARKERS="spyre and prefix_caching and not quantized"

Signed-off-by: Max de Bayser <mbayser@br.ibm.com>

tjohnson31415

LGTM

chore: add support for v0.18.0

98e417b

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

rafvasq force-pushed the support-v0.18.0 branch from 6fb6d7e to 98e417b Compare March 23, 2026 15:40

Add empty_cache workaround

5e3cabf

Co-authored-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Rafael Vasquez <rafvasq21@gmail.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

rafvasq force-pushed the support-v0.18.0 branch from 77179e2 to 5e3cabf Compare March 23, 2026 17:18

rafvasq and others added 3 commits March 23, 2026 13:27

Rm deprecated swap_space arg

b0929d6

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

Merge branch 'vllm-project:main' into support-v0.18.0

3c7406e

tjohnson31415 mentioned this pull request Mar 24, 2026

chore (1.x): support vllm 0.18.0 without backwards compat #861

Merged

5 tasks

maxdebayser added 2 commits March 24, 2026 16:38

remove debug print (oops)

eb0609b

Signed-off-by: Max de Bayser <maxdebayser@gmail.com>

fix type error

fb2c1f4

Signed-off-by: Max de Bayser <maxdebayser@gmail.com>

maxdebayser marked this pull request as ready for review March 25, 2026 13:02

maxdebayser requested review from joerunde, nikolaospapandreou, prashantgupta24, sducouedic, tdoublep and yannicks1 as code owners March 25, 2026 13:02

rafvasq enabled auto-merge (squash) March 25, 2026 14:18

rafvasq disabled auto-merge March 25, 2026 14:18

rafvasq requested a review from tjohnson31415 March 25, 2026 14:18

tjohnson31415 reviewed Mar 25, 2026

View reviewed changes

Comment thread vllm_spyre/platform.py Outdated

github-actions Bot added the ready Runs the full CI test suite. Only add to PRs once ready to merge to limit public GHA usage label Mar 25, 2026

just use noop for all cases

8e3031e

Signed-off-by: Max de Bayser <mbayser@br.ibm.com>

tjohnson31415 reviewed Mar 25, 2026

View reviewed changes

Comment thread vllm_spyre/platform.py Outdated

remove backward compatibility check

9edee06

Signed-off-by: Max de Bayser <mbayser@br.ibm.com>

tjohnson31415 approved these changes Mar 25, 2026

View reviewed changes

tjohnson31415 merged commit 2d816a7 into torch-spyre:main Mar 25, 2026
13 of 14 checks passed

rafvasq deleted the support-v0.18.0 branch March 25, 2026 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: add support for v0.18.0#857

chore: add support for v0.18.0#857
tjohnson31415 merged 10 commits intotorch-spyre:mainfrom
rafvasq:support-v0.18.0

rafvasq commented Mar 23, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Mar 23, 2026

Uh oh!

maxdebayser commented Mar 24, 2026

Uh oh!

tjohnson31415 commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

rafvasq commented Mar 25, 2026

Uh oh!

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

Uh oh!

tjohnson31415 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rafvasq commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Checklist

Uh oh!

github-actions Bot commented Mar 23, 2026

Uh oh!

maxdebayser commented Mar 24, 2026

Uh oh!

tjohnson31415 commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

rafvasq commented Mar 25, 2026

Uh oh!

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

maxdebayser commented Mar 25, 2026

Uh oh!

Uh oh!

tjohnson31415 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rafvasq commented Mar 23, 2026 •

edited

Loading