UPSTREAM PR #19204: ggml-virtgpu: make the code thread safe by loci-dev · Pull Request #1094 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-31T04:53:08Z

Note

Source pull request: ggml-org/llama.cpp#19204

This PR improves the code of the ggml-virtgpu backend to make it thread safe, by using mutex for accessing the host<>guest shared memory buffers, and by pre-caching, during the initialization, the constant values queried from the backend.

The unused buffer_type_is_host method is also deprecated.

…function

not necessary

The static init isn't thread safe.

… memory

…sing

loci-review · 2026-01-31T05:45:25Z

No meaningful performance changes were detected across 115126 analyzed functions in the following binaries: build.bin.llama-tts, build.bin.llama-cvector-generator, build.bin.libllama.so, build.bin.libmtmd.so, build.bin.llama-tokenize, build.bin.llama-quantize, build.bin.llama-qwen2vl-cli, build.bin.libggml-base.so, build.bin.libggml-cpu.so, build.bin.libggml.so, build.bin.llama-bench, build.bin.llama-gemma3-cli, build.bin.llama-gguf-split, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

loci-review · 2026-02-03T14:44:55Z

No meaningful performance changes were detected across 115126 analyzed functions in the following binaries: build.bin.llama-tts, build.bin.llama-cvector-generator, build.bin.libllama.so, build.bin.libmtmd.so, build.bin.llama-gguf-split, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli, build.bin.llama-quantize, build.bin.llama-gemma3-cli, build.bin.llama-tokenize, build.bin.llama-qwen2vl-cli, build.bin.llama-bench, build.bin.libggml-base.so, build.bin.libggml-cpu.so, build.bin.libggml.so.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

kpouget added 10 commits January 30, 2026 11:48

ggml-virtgpu: regenerate_remoting.py: add the ability to deprecate a …

e9d130a

…function

ggml-virtgpu: deprecate buffer_type is_host remoting

c6a6085

not necessary

ggml-virtgpu: stop using static vars as cache

92390ad

The static init isn't thread safe.

ggml-virtgpu: protect the use of the shared memory to transfer data

fcc6890

ggml-virtgpu: make the remote calls thread-safe

171ab8b

ggml-virtgpu: backend: don't continue if couldn't allocate the tensor…

3864be3

… memory

ggml-virtgpu: add a cleanup function for consistency

07f41ca

ggml-virtgpu: backend: don't crash if buft->iface.get_max_size is mis…

f35b24e

…sing

fix style and ordering

f978082

Remove the static variable in apir_device_get_count

e9e9d73

loci-dev temporarily deployed to PROD__AL_DEMO January 31, 2026 04:53 — with GitHub Actions Inactive

loci-dev force-pushed the main branch from 96d29ac to dbad616 Compare January 31, 2026 05:22

loci-dev force-pushed the main branch 17 times, most recently from 6fab7f9 to bbab1c9 Compare January 31, 2026 22:08

loci-dev force-pushed the main branch 28 times, most recently from 4d805ce to 7ff3e7f Compare February 3, 2026 08:18

ggml-virtgpu: improve the logging

5cbf046

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #19204: ggml-virtgpu: make the code thread safe#1094

UPSTREAM PR #19204: ggml-virtgpu: make the code thread safe#1094
loci-dev wants to merge 11 commits intomainfrom
loci/pr-19204-leaks

loci-dev commented Jan 31, 2026

Uh oh!

loci-review bot commented Jan 31, 2026

Uh oh!

loci-review bot commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Jan 31, 2026

Uh oh!

loci-review bot commented Jan 31, 2026

Uh oh!

loci-review bot commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants