Skip to content

[BUG] Reset running requests when clearing cache for pause/resume#34382

Merged
robertgshaw2-redhat merged 1 commit intovllm-project:mainfrom
hao-aaron:keep-mode-fix
Feb 12, 2026
Merged

[BUG] Reset running requests when clearing cache for pause/resume#34382
robertgshaw2-redhat merged 1 commit intovllm-project:mainfrom
hao-aaron:keep-mode-fix

Conversation

@hao-aaron
Copy link
Contributor

@hao-aaron hao-aaron commented Feb 11, 2026

Purpose

Addressing #32351 (comment), in progress requests block clearing prefix cache in keep mode.

Test Plan

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

x
Signed-off-by: hao-aaron <ahao@anyscale.com>
@hao-aaron hao-aaron marked this pull request as ready for review February 11, 2026 22:35
@mergify mergify bot added v1 bug Something isn't working labels Feb 11, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug where in-progress requests would block clearing the prefix cache when pause_generation is called with mode="keep". The fix correctly passes reset_running_requests=True to reset_prefix_cache, ensuring that the state of running requests is properly handled when the cache is cleared. This change is safe for other modes (abort and wait) as there are no running requests in those scenarios. The fix is correct and well-contained.

@robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) February 11, 2026 23:00
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 11, 2026
Copy link
Member

@njhill njhill left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should check the implications for the MM caches too.

@robertgshaw2-redhat robertgshaw2-redhat merged commit 7b5a8b4 into vllm-project:main Feb 12, 2026
47 of 48 checks passed
eldarkurtic pushed a commit to eldarkurtic/vllm that referenced this pull request Feb 19, 2026
…lm-project#34382)

Signed-off-by: hao-aaron <ahao@anyscale.com>
Signed-off-by: Eldar Kurtic <research@neuralmagic.com>
llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants