Skip to content

UPSTREAM PR #18610: ggml webgpu: initial flashattention implementation#821

Open
loci-dev wants to merge 3 commits into
mainfrom
upstream-PR18610-branch_reeselevine-master
Open

UPSTREAM PR #18610: ggml webgpu: initial flashattention implementation#821
loci-dev wants to merge 3 commits into
mainfrom
upstream-PR18610-branch_reeselevine-master

formatting shader

e5bf2d5
Select commit
Loading
Failed to load commit list.
LOCI Review / Performance Review #821 succeeded Jan 5, 2026

Performance unchanged

0 binaries improved · 0 binaries unchanged · 0 binaries stable ~ within threshold · 0 binaries degraded ~ beyond threshold

Binary Δ % Response Δ % Throughput Performance (based on response time)

Performance threshold: 30%
Default configuration used.
Note: Performance status is evaluated only from Δ% Response. Throughput is displayed for reference.

Explore the complete analysis inside the Version Insights.
Open the Pull Request linked to this check-run.