[cpu][bench] Add CPU paged attention benchmarks by fadara01 · Pull Request #31720 · vllm-project/vllm

fadara01 · 2026-01-05T12:28:17Z

Purpose

Add CPU paged attention benchmarks
Fixes: #30374

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

[ Y] The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

fadara01 · 2026-01-05T12:29:48Z

@bigPYJ1151 could you please review?

gemini-code-assist

Code Review

This pull request introduces a benchmark script for CPU paged attention, which is a valuable addition for performance testing and optimization. The script is well-structured and provides a good range of configurable parameters. My review focuses on improving the robustness of the main benchmark function to prevent potential runtime errors if it's used in different contexts.

benchmarks/kernels/cpu/benchmark_cpu_attn.py

Fixes: vllm-project#30374 Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

bigPYJ1151

Thanks!

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

mergify bot added the performance Performance-related issues label Jan 5, 2026

mergify bot assigned fadara01 Jan 5, 2026

mergify bot added the cpu Related to CPU backends label Jan 5, 2026

gemini-code-assist bot reviewed Jan 5, 2026

View reviewed changes

benchmarks/kernels/cpu/benchmark_cpu_attn.py Outdated Show resolved Hide resolved

[cpu][bench] Add CPU paged attention benchmarks

e12096e

Fixes: vllm-project#30374 Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

fadara01 force-pushed the cpu_attn_benchmark branch from b025d3d to e12096e Compare January 5, 2026 12:33

fadara01 mentioned this pull request Jan 5, 2026

[Feature]: Fused MoE Micro Benchmark for CPU Backend #31721

Closed

1 task

bigPYJ1151 approved these changes Jan 6, 2026

View reviewed changes

bigPYJ1151 enabled auto-merge (squash) January 6, 2026 08:22

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 6, 2026

Merge branch 'main' into cpu_attn_benchmark

5f76b68

bigPYJ1151 merged commit 799b572 into vllm-project:main Jan 6, 2026
17 checks passed

LucasWilkinson pushed a commit to neuralmagic/vllm that referenced this pull request Jan 6, 2026

[cpu][bench] Add CPU paged attention benchmarks (vllm-project#31720)

7d4459f

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[cpu][bench] Add CPU paged attention benchmarks (vllm-project#31720)

fbffb26

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

gassan-arm mentioned this pull request Jan 13, 2026

[cpu][performance] CPU Paged Attention NEON BFMMLA BF16 Implementation #32263

Merged

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[cpu][bench] Add CPU paged attention benchmarks (vllm-project#31720)

e986777

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[cpu][bench] Add CPU paged attention benchmarks (vllm-project#31720)

4576bfe

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[cpu][bench] Add CPU paged attention benchmarks (vllm-project#31720)

d20d06f

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[cpu][bench] Add CPU paged attention benchmarks#31720

[cpu][bench] Add CPU paged attention benchmarks#31720
bigPYJ1151 merged 2 commits intovllm-project:mainfrom
fadara01:cpu_attn_benchmark

fadara01 commented Jan 5, 2026 •

edited by github-actions bot

Loading

Uh oh!

fadara01 commented Jan 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

bigPYJ1151 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

fadara01 commented Jan 5, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

fadara01 commented Jan 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

bigPYJ1151 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fadara01 commented Jan 5, 2026 •

edited by github-actions bot

Loading