Skip to content

πŸ› Fix tiered-prefix-cache CrashLoopBackOff: num_cpu_blocks β†’ cpu_bytes_to_use#768

Merged
clubanderson merged 1 commit intomainfrom
fix/tpc-cpu-bytes-to-use
Feb 14, 2026
Merged

πŸ› Fix tiered-prefix-cache CrashLoopBackOff: num_cpu_blocks β†’ cpu_bytes_to_use#768
clubanderson merged 1 commit intomainfrom
fix/tpc-cpu-bytes-to-use

Conversation

@clubanderson
Copy link
Copy Markdown
Contributor

Summary

  • Fixes the nightly E2E tiered-prefix-cache/cpu workflow CrashLoopBackOff on OpenShift
  • vLLM v0.14.1 (PR vllm-project/vllm#24498, merged Jan 12 2026) replaced num_cpu_blocks with cpu_bytes_to_use in the OffloadingConnector's CPUOffloadingSpec
  • The guide manifest and nightly workflow still used the old num_cpu_blocks parameter, causing: Exception: cpu_bytes_to_use must be specified in kv_connector_extra_config

Changes

  • Guide manifest (offloading-connector/kustomization.yaml): num_cpu_blocks: 41000 β†’ cpu_bytes_to_use: 107374182400 (100GB)
  • Nightly slim patch (nightly-e2e-tiered-prefix-cache.yaml): num_cpu_blocks: 4000 β†’ cpu_bytes_to_use: 10737418240 (10GB)
  • Benchmark docs (README.md): Updated parameter reference

Test plan

  • Nightly E2E tiered-prefix-cache workflow passes on OpenShift (no more CrashLoopBackOff)
  • Manual kubectl apply -k guides/tiered-prefix-cache/cpu/manifests/vllm/offloading-connector deploys successfully

…s_to_use

vLLM v0.14.1 (PR #24498, merged Jan 12 2026) replaced the
`num_cpu_blocks` config key with `cpu_bytes_to_use` in the
OffloadingConnector's CPUOffloadingSpec. This causes model server
pods to crash immediately with:

  Exception: cpu_bytes_to_use must be specified in kv_connector_extra_config

Update all references:
- Guide manifest: 41000 blocks (~100GB) β†’ 107374182400 bytes (100GB)
- Nightly slim patch: 4000 blocks (~10GB) β†’ 10737418240 bytes (10GB)
- README benchmark section: updated parameter name

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Signed-off-by: Andrew Anderson <andy@clubanderson.com>
@clubanderson clubanderson merged commit 7e80a18 into main Feb 14, 2026
23 of 24 checks passed
@clubanderson
Copy link
Copy Markdown
Contributor Author

@Gregory-Pereira ptal!

@clubanderson
Copy link
Copy Markdown
Contributor Author

this was reverted - will open new pr

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant