metal : support virtual devices by ggerganov · Pull Request #18919 · ggml-org/llama.cpp

ggerganov · 2026-01-18T16:40:31Z

Support virtual Metal devices. Allows simulating multi-GPU environments on Mac using the new GGML_METAL_DEVICES environment variable.

GGML_METAL_DEVICES=4 ./bin/llama-completion -m [model.gguf]

...

0.02.020.033 I llama_memory_breakdown_print: | memory breakdown [MiB]    |  total     free    self   model   context   compute    unaccounted |
0.02.020.034 I llama_memory_breakdown_print: |   - MTL0 (Apple M2 Ultra) | 165150 = 158091 + (1916 =   780 +    1024 +     112) +        5143 |
0.02.020.034 I llama_memory_breakdown_print: |   - MTL1 (Apple M2 Ultra) | 165150 = 158091 + (1738 =   780 +     896 +      62) +        5320 |
0.02.020.036 I llama_memory_breakdown_print: |   - MTL2 (Apple M2 Ultra) | 165150 = 158091 + (1198 =   240 +     896 +      62) +        5861 |
0.02.020.037 I llama_memory_breakdown_print: |   - MTL3 (Apple M2 Ultra) | 165150 = 158091 + (2205 =  1137 +     768 +     300) +        4853 |
0.02.020.037 I llama_memory_breakdown_print: |   - Host                  |                     364 =   296 +       0 +      68                |

* metal : support virtual devices * cont : manage buffer type context memory * metal : add events * cont : implement cpy_tensor_async

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Jan 18, 2026

loci-dev mentioned this pull request Jan 18, 2026

UPSTREAM PR #18919: metal : support virtual devices auroralabs-loci/llama.cpp#960

Open

ggerganov force-pushed the gg/metal-virtual-devices branch from e644345 to 06ac1f7 Compare January 20, 2026 15:40

This was referenced Jan 20, 2026

metal : add events #18966

Merged

CUDA: Improve performance via less synchronizations between token #17795

Merged

ggerganov added 2 commits January 31, 2026 10:05

metal : support virtual devices

7758a58

cont : manage buffer type context memory

4d19e61

ggerganov force-pushed the gg/metal-virtual-devices branch from 06ac1f7 to 4d19e61 Compare January 31, 2026 08:05

loci-dev mentioned this pull request Jan 31, 2026

UPSTREAM PR #18919: metal : support virtual devices auroralabs-loci/llama.cpp#1103

Open

ggerganov added 2 commits January 31, 2026 18:00

metal : add events

5c35f46

cont : implement cpy_tensor_async

2976dd8

ggerganov merged commit 6fdddb4 into master Feb 2, 2026
73 of 78 checks passed

ggerganov deleted the gg/metal-virtual-devices branch February 2, 2026 12:29

ggerganov mentioned this pull request Feb 3, 2026

ci : add metal server workflows #19293

Merged

1 task

shaofeiqi pushed a commit to qualcomm/llama.cpp that referenced this pull request Feb 6, 2026

metal : support virtual devices (ggml-org#18919)

f6cc6fb

* metal : support virtual devices * cont : manage buffer type context memory * metal : add events * cont : implement cpy_tensor_async

liparetejas pushed a commit to liparetejas/llama.cpp that referenced this pull request Feb 23, 2026

metal : support virtual devices (ggml-org#18919)

74b3495

* metal : support virtual devices * cont : manage buffer type context memory * metal : add events * cont : implement cpy_tensor_async

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metal : support virtual devices#18919

metal : support virtual devices#18919
ggerganov merged 4 commits intomasterfrom
gg/metal-virtual-devices

ggerganov commented Jan 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ggerganov commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ggerganov commented Jan 18, 2026 •

edited

Loading