✨ Use shared CachedRequestData as vllm:main by prashantgupta24 · Pull Request #273 · torch-spyre/sendnn-inference

prashantgupta24 · 2025-07-01T17:34:43Z

Description

This is more complicated than I thought originally :)

Alright, all tests are passing locally. There seems to be another breaking change in vllm:main that will have to be addressed to make the main tests pass. The default tests fail because this code is not backward compatible yet 😅

Related Issues

fix #271

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

github-actions · 2025-07-01T17:34:52Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

maxdebayser · 2025-07-03T18:00:18Z

@prashantgupta24 , I think the other breaking change that you mentioned is the sampling metadata one, right? I've opened a hacky PR to temporarily fix this: #278

prashantgupta24 · 2025-07-03T18:05:54Z

@prashantgupta24 , I think the other breaking change that you mentioned is the sampling metadata one, right? I've opened a hacky PR to temporarily fix this: #278

Yep, thanks!

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 · 2025-07-07T19:13:40Z

closing in favor of #283

# Description This branch has a fix for: - Caching the token_ids (now the new tokens are cached in `execute_model` instead of `update_states`. This is because of vllm-project/vllm#20291. ) - Changes from the `CachedRequestData` (#273) ## Related Issues Fix for #271 --------- Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com> Signed-off-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Max de Bayser <mbayser@br.ibm.com>

run tests with markers

🐛 req_ids is now a list

f7f6bb6

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 added 2 commits July 1, 2025 10:40

🐛 req_ids is now a list

8f7acaf

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

♻️ using cached requests

06b8d7d

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 changed the title ~~🐛 req_ids is now a list in vllm:main~~ 🐛 Use shared CachedRequestData as vllm:main Jul 1, 2025

prashantgupta24 added 2 commits July 2, 2025 11:55

🐛 first pass for sb

a064d3b

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

🐛 first pass for cb

989f6d2

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 changed the title ~~🐛 Use shared CachedRequestData as vllm:main~~ ✨ Use shared CachedRequestData as vllm:main Jul 2, 2025

prashantgupta24 added 2 commits July 2, 2025 14:45

Merge remote-tracking branch 'upstream/main' into fix-upstream

1dcc582

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

🐛 fix merge bug

bd7e008

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 commented Jul 2, 2025

View reviewed changes

Comment thread vllm_spyre/v1/worker/spyre_worker.py

prashantgupta24 added 3 commits July 3, 2025 09:30

🎨 renaming vars

fe8e64c

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

🔥 remove commented code

df9214b

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

🔥 remove extra commas

f065785

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

🚧 wip to see if tests pass

d965338

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 mentioned this pull request Jul 4, 2025

vllm main updates #283

Merged

prashantgupta24 closed this Jul 7, 2025

prashantgupta24 deleted the fix-upstream branch March 5, 2026 17:01

rafvasq pushed a commit to rafvasq/sendnn-inference that referenced this pull request Mar 11, 2026

Merge pull request torch-spyre#273 from ai-foundation/test_w_markers

973d9b5

run tests with markers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Use shared CachedRequestData as vllm:main#273

✨ Use shared CachedRequestData as vllm:main#273
prashantgupta24 wants to merge 11 commits intomainfrom
fix-upstream

prashantgupta24 commented Jul 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

prashantgupta24 commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

maxdebayser commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 3, 2025

Uh oh!

prashantgupta24 commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

prashantgupta24 commented Jul 1, 2025 •

edited

Loading