UPSTREAM PR #17184: ggml webgpu: add support for emscripten builds by DajanaV · Pull Request #176 · auroralabs-loci/llama.cpp

DajanaV · 2025-11-12T02:11:31Z

This PR builds on and supersedes ggml-org/llama.cpp#15826 from @ngxson.

Adds __EMSCRIPTEN__ preprocessors conditionals in places necessary for compilation (this included some OS specific things in common/
Adds flags for emscripten builds for 64-bit memory, memory growth that I found to be required to get the backend operations to pass when running through the browser (Chrome/M3 system)
Also adds a Github workflow that ensures emscripten builds compile (at least with test-backend-ops), to avoid other code additions that accidentally break things.
Disables dawn/native-only WebGPU features when building for the browser, like experimental support for subgroup matrices

Add fast matrix and matrix/vector multiplication.

* webgpu : fix build on emscripten * more debugging stuff * test-backend-ops: force single thread on wasm * fix single-thread case for init_tensor_uniform * use jspi * add pthread * test: remember to set n_thread for cpu backend * Add buffer label and enable dawn-specific toggles to turn off some checks * Intermediate state * Fast working f16/f32 vec4 * Working float fast mul mat * Clean up naming of mul_mat to match logical model, start work on q mul_mat * Setup for subgroup matrix mat mul * Basic working subgroup matrix * Working subgroup matrix tiling * Handle weirder sg matrix sizes (but still % sg matrix size) * Working start to gemv * working f16 accumulation with shared memory staging * Print out available subgroup matrix configurations * Vectorize dst stores for sg matrix shader * Gemv working scalar * Minor set_rows optimization (#4) * updated optimization, fixed errors * non vectorized version now dispatches one thread per element * Simplify * Change logic for set_rows pipelines --------- Co-authored-by: Neha Abbas <nehaabbas@macbookpro.lan> Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local> Co-authored-by: Reese Levine <reeselevine1@gmail.com> * Comment on dawn toggles * Working subgroup matrix code for (semi)generic sizes * Remove some comments * Cleanup code * Update dawn version and move to portable subgroup size * Try to fix new dawn release * Update subgroup size comment * Only check for subgroup matrix configs if they are supported * Add toggles for subgroup matrix/f16 support on nvidia+vulkan * Make row/col naming consistent * Refactor shared memory loading * Move sg matrix stores to correct file * Working q4_0 * Formatting * Work with emscripten builds * Fix test-backend-ops emscripten for f16/quantized types * Use emscripten memory64 to support get_memory * Add build flags and try ci --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co>

reeselevine and others added 4 commits November 5, 2025 08:24

Faster tensors (#8)

c6bc125

Add fast matrix and matrix/vector multiplication.

Use map for shader replacements instead of pair of strings

7c2b2ef

Merge remote-tracking branch 'upstream/master'

6db7298

DajanaV force-pushed the main branch 26 times, most recently from 29827de to a802168 Compare November 15, 2025 11:06

loci-dev force-pushed the main branch 25 times, most recently from 048ad94 to 6c1fde6 Compare February 3, 2026 13:32

loci-dev force-pushed the main branch 5 times, most recently from f998d1f to 30ef9d0 Compare February 16, 2026 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #17184: ggml webgpu: add support for emscripten builds#176

UPSTREAM PR #17184: ggml webgpu: add support for emscripten builds#176
DajanaV wants to merge 4 commits intomainfrom
upstream-PR17184-branch_reeselevine-master

DajanaV commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DajanaV commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants