Skip to content

CUDA: disable cuda graph when using n-cpu-moe#18593

Merged
am17an merged 2 commits intoggml-org:masterfrom
am17an:cuda-graph-disable-cpu-moe
Jan 4, 2026
Merged

CUDA: disable cuda graph when using n-cpu-moe#18593
am17an merged 2 commits intoggml-org:masterfrom
am17an:cuda-graph-disable-cpu-moe

Conversation

@am17an
Copy link
Contributor

@am17an am17an commented Jan 4, 2026

Missed disabling cuda graphs when -n-cpu-moe is used

@github-actions github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jan 4, 2026
@am17an am17an merged commit 908a9e5 into ggml-org:master Jan 4, 2026
71 checks passed
@am17an am17an deleted the cuda-graph-disable-cpu-moe branch January 4, 2026 17:37
blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026
* CUDA: disable cuda graph when using n-cpu-moe

* call ggml_cuda_set_device
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants