Skip to content

ggml-webgpu: FlashAttention refactor + standardize quantization support#23834

Merged
ggerganov merged 9 commits into
ggml-org:masterfrom
reeselevine:flash_attn_refactor
Jun 4, 2026
Merged

ggml-webgpu: FlashAttention refactor + standardize quantization support#23834
ggerganov merged 9 commits into
ggml-org:masterfrom
reeselevine:flash_attn_refactor

Commits

Commits on May 27, 2026

Commits on May 28, 2026

Commits on Jun 2, 2026