[CPU] Support head sizes 80 and 112 with vec16 fallback by Meghagaur · Pull Request #251 · red-hat-data-services/vllm-cpu

Meghagaur · 2026-01-14T18:27:46Z

Purpose
Reintroduce support for head dimensions 80 and 112 in CPU attention backend which were previously removed in vllm-project/vllm#27954 but these head dimensions are commonly used by granite models deployed on Z archs. Since these heads are not friendly for Intel AMX instruction set. The implementation now falls back to vec16.

Test Plan
Build Docker image and test using ibm-granite/granite-3b-code-base-2k model which has head size of 80.

upstream PR - vllm-project/vllm#31968

Meghagaur · 2026-01-14T18:36:07Z

/build-konflux

Meghagaur · 2026-01-15T04:43:03Z

Hi @wznoinsk
Could you please review this PR ? The Konflux build has passed.
This is required for vLLM CPU to work correctly on RHOAI 3.2.
Thank you

cpu-atten-fix

26a4ed8

Meghagaur changed the title ~~cpu-atten-fix~~ [CPU] Support head sizes 80 and 112 with vec16 fallback Jan 14, 2026

Meghagaur requested a review from wznoinsk January 15, 2026 04:43

Meghagaur mentioned this pull request Jan 15, 2026

vllm-cpu attention fix #250

Merged

moulalis approved these changes Jan 15, 2026

View reviewed changes

moulalis merged commit cf334b6 into main Jan 15, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] Support head sizes 80 and 112 with vec16 fallback#251

[CPU] Support head sizes 80 and 112 with vec16 fallback#251
moulalis merged 1 commit intomainfrom
cpu-atten-main

Meghagaur commented Jan 14, 2026

Uh oh!

Meghagaur commented Jan 14, 2026

Uh oh!

Meghagaur commented Jan 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Meghagaur commented Jan 14, 2026

Uh oh!

Meghagaur commented Jan 14, 2026

Uh oh!

Meghagaur commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Meghagaur commented Jan 15, 2026 •

edited

Loading