UPSTREAM PR #18610: ggml webgpu: initial flashattention implementation #821
LOCI Review / Performance Review #821
succeeded
Jan 5, 2026
Performance unchanged
0 binaries improved · 0 binaries unchanged · 0 binaries stable ~ within threshold · 0 binaries degraded ~ beyond threshold
| Binary | Δ % Response | Δ % Throughput | Performance (based on response time) |
|---|
Performance threshold: 30%
Default configuration used.
Note: Performance status is evaluated only from Δ% Response. Throughput is displayed for reference.
Explore the complete analysis inside the Version Insights.
Open the Pull Request linked to this check-run.
Loading