-
Notifications
You must be signed in to change notification settings - Fork 13k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
examples : support encoder-decoder models in the simple example
examples
#16002
opened Sep 15, 2025 by
DamonFool
Loading…
--numa mirror
: mirror model weights to every Numa node in the system
Apple Metal
docker : enable rocWMMA in ROCm images, add gfx1151
devops
improvements to build systems and github actions
#15997
opened Sep 14, 2025 by
slaren
Loading…
metal : refactor + optimize v2
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
vulkan : shader development improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15993
opened Sep 14, 2025 by
Acly
Loading…
releases : switch to rocWMMA develop branch, add gfx1151
devops
improvements to build systems and github actions
#15992
opened Sep 14, 2025 by
slaren
Loading…
SYCL: Add COUNT_EQUAL operator support
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15991
opened Sep 14, 2025 by
yael-works
Loading…
ggml: add FLOOR unary op (CPU + SYCL)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15989
opened Sep 14, 2025 by
safranowith
Loading…
metal : use virtual GPU address for private buffers
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
CUDA: fix FA occupancy, optimize tile kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15982
opened Sep 14, 2025 by
JohannesGaessler
Loading…
SYCL: Add ARANGE operator with GPU kernel, tests, and documentation
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15978
opened Sep 14, 2025 by
GittyBurstein
Loading…
vulkan: automatically remove unsupported devices
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15976
opened Sep 14, 2025 by
netrunnereve
Loading…
Add resumable downloads for llama-server model loading
#15963
opened Sep 13, 2025 by
ericcurtin
Loading…
CUDA: Optimize PAD_REFLECT_1D
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#15957
opened Sep 13, 2025 by
bugparty
Loading…
CUDA: fix im2col_3d to respect non-contiguous inputs (views)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15956
opened Sep 13, 2025 by
jakekarnes42
Loading…
ggml-cpu: optimize the ggml NORM operation
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#15953
opened Sep 12, 2025 by
duduta
Loading…
opencl: fix concat crash on win arm64 with Adreno
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15944
opened Sep 12, 2025 by
lhez
Loading…
ci : update macos-latest* jobs to use macos-latest
devops
improvements to build systems and github actions
#15938
opened Sep 11, 2025 by
danbev
Loading…
CANN: Fix ggml_cann_set_device to avoid redundant device switches
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15935
opened Sep 11, 2025 by
noemotiovon
Loading…
ggml : fix padding in timestep embedding kernels
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#15932
opened Sep 11, 2025 by
danbev
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.