Temporarily disable prefix caching on Metal by Kingwl · Pull Request #187 · vllm-project/vllm-metal

Kingwl · 2026-03-21T06:45:44Z

This PR temporarily disables vLLM core prefix caching on Metal when paged attention is enabled.

As #185 's comments
Fixed #184

Signed-off-by: kingwl <kingwenlu@gmail.com>

WindChimeRan

Two non-blocking questions:

For dev purposes, should we add an env var override (e.g. VLLM_METAL_FORCE_PREFIX_CACHING=1) so we can test the prefix caching path while it's disabled by default? We'll need to develop this soon. (This is an open question. Maybe you have better idea)
The closed #185 had some bookkeeping fixes that look independent of the prefix caching issue. Would you consider splitting those out into a separate PR? I haven't verified them in detail yet, but they seem like useful prep work.

Kingwl · 2026-03-21T16:27:49Z

Sure.

Temporarily disable prefix caching on Metal

d923672

Signed-off-by: kingwl <kingwenlu@gmail.com>

WindChimeRan approved these changes Mar 21, 2026

View reviewed changes

WindChimeRan merged commit c8d2715 into vllm-project:main Mar 21, 2026
5 checks passed

WindChimeRan mentioned this pull request Mar 21, 2026

Support partitioned Metal attention #181

Merged

Kingwl mentioned this pull request Mar 21, 2026

Fix unified prefill chunk bookkeeping #192

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporarily disable prefix caching on Metal#187

Temporarily disable prefix caching on Metal#187
WindChimeRan merged 1 commit intovllm-project:mainfrom
Kingwl:fix/prefix-cache-engine-crash-disable-prefix-cache

Kingwl commented Mar 21, 2026 •

edited

Loading

Uh oh!

WindChimeRan left a comment

Uh oh!

Uh oh!

Kingwl commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Kingwl commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WindChimeRan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Kingwl commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Kingwl commented Mar 21, 2026 •

edited

Loading