Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add LLaDA-7b-MoE diffusion model
#16003 opened Sep 15, 2025 by am17an Loading…
--numa mirror: mirror model weights to every Numa node in the system Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions examples ggml changes relating to the ggml tensor library for machine learning IBM zDNN issues specific to IBM zDNN Accelerator Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#16000 opened Sep 15, 2025 by dbsanfte Draft
docker : enable rocWMMA in ROCm images, add gfx1151 devops improvements to build systems and github actions
#15997 opened Sep 14, 2025 by slaren Loading…
metal : refactor + optimize v2 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#15995 opened Sep 14, 2025 by ggerganov Draft
2 of 5 tasks
vulkan : shader development improvements ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15993 opened Sep 14, 2025 by Acly Loading…
releases : switch to rocWMMA develop branch, add gfx1151 devops improvements to build systems and github actions
#15992 opened Sep 14, 2025 by slaren Loading…
SYCL: Add COUNT_EQUAL operator support documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#15991 opened Sep 14, 2025 by yael-works Loading…
ggml: add FLOOR unary op (CPU + SYCL) ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#15989 opened Sep 14, 2025 by safranowith Loading…
llama-run: Fix model download on Windows examples
#15988 opened Sep 14, 2025 by npopov-vst Loading…
metal : use virtual GPU address for private buffers Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#15985 opened Sep 14, 2025 by ggerganov Draft
CUDA: fix FA occupancy, optimize tile kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15982 opened Sep 14, 2025 by JohannesGaessler Loading…
SYCL: Add ARANGE operator with GPU kernel, tests, and documentation documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#15978 opened Sep 14, 2025 by GittyBurstein Loading…
vulkan: automatically remove unsupported devices ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15976 opened Sep 14, 2025 by netrunnereve Loading…
cli: allow layer groups in --n-cpu-moe
#15975 opened Sep 14, 2025 by lksj92hs Loading…
CUDA: Optimize PAD_REFLECT_1D ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15957 opened Sep 13, 2025 by bugparty Loading…
CUDA: fix im2col_3d to respect non-contiguous inputs (views) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15956 opened Sep 13, 2025 by jakekarnes42 Loading…
ggml-cpu: optimize the ggml NORM operation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15953 opened Sep 12, 2025 by duduta Loading…
opencl: fix concat crash on win arm64 with Adreno ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#15944 opened Sep 12, 2025 by lhez Loading…
ci : update macos-latest* jobs to use macos-latest devops improvements to build systems and github actions
#15938 opened Sep 11, 2025 by danbev Loading…
CANN: Fix ggml_cann_set_device to avoid redundant device switches Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15935 opened Sep 11, 2025 by noemotiovon Loading…
ggml : fix padding in timestep embedding kernels Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#15932 opened Sep 11, 2025 by danbev Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.