Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add AfmoeForCausalLM support python python script changes
#16477 opened Oct 8, 2025 by bartowski1182 Draft
[SYCL] refactor soft_max, add soft_max_back ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16472 opened Oct 8, 2025 by NeoZhangJianyu Loading…
CUDA Copy Kernel for Contiguous Tensors for GGML CPY OP ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16471 opened Oct 8, 2025 by anavp-nvidia Loading…
fix: convert_hf_to_gguf - change Jamba non-sentencepiece mode (tokeni… python python script changes
#16470 opened Oct 8, 2025 by amirai21 Loading…
opencl: add q8_0 mm support ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#16469 opened Oct 8, 2025 by lhez Draft
vulkan: Add State Space Model (SSM) Operations Support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16463 opened Oct 7, 2025 by giuseppe Loading…
ci: add ARM64 Kleidiai build and test support devops improvements to build systems and github actions
#16462 opened Oct 7, 2025 by sudhiarm Loading…
kleidiai: kernel interface refactoring ggml changes relating to the ggml tensor library for machine learning
#16460 opened Oct 7, 2025 by chaxu01 Loading…
Add hipblasLt implementation for batched gemm to improve performance for CDNA3 only ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16457 opened Oct 7, 2025 by peizhang56 Loading…
vulkan: Handle FA with all -inf mask values ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16447 opened Oct 6, 2025 by jeffbolznv Loading…
Metal Pool 1D Kernel Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#16429 opened Oct 5, 2025 by ThoreKoritzius Loading…
Implement llama-pull tool examples
#16423 opened Oct 4, 2025 by ericcurtin Loading…
contrib : add fish completions via --completion-fish
#16404 opened Oct 3, 2025 by g0t4 Loading…
server : host-memory prompt caching examples python python script changes server
#16391 opened Oct 2, 2025 by ggerganov Loading…
4 of 5 tasks
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules python python script changes
#16367 opened Oct 1, 2025 by sfallah Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes) ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362 opened Sep 30, 2025 by GittyBurstein Loading…
SYCL SET operator optimized for F32 tensors ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350 opened Sep 30, 2025 by GittyBurstein Loading…
Update build.md documentation Improvements or additions to documentation
#16346 opened Sep 30, 2025 by refine360-debug Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU ggml changes relating to the ggml tensor library for machine learning
#16333 opened Sep 29, 2025 by angt Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.