-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add AfmoeForCausalLM support
python
python script changes
#16477
opened Oct 8, 2025 by
bartowski1182
•
Draft
[SYCL] refactor soft_max, add soft_max_back
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16472
opened Oct 8, 2025 by
NeoZhangJianyu
Loading…
CUDA Copy Kernel for Contiguous Tensors for GGML CPY OP
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16471
opened Oct 8, 2025 by
anavp-nvidia
Loading…
fix: convert_hf_to_gguf - change Jamba non-sentencepiece mode (tokeni…
python
python script changes
#16470
opened Oct 8, 2025 by
amirai21
Loading…
vulkan: Add State Space Model (SSM) Operations Support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16463
opened Oct 7, 2025 by
giuseppe
Loading…
ci: add ARM64 Kleidiai build and test support
devops
improvements to build systems and github actions
#16462
opened Oct 7, 2025 by
sudhiarm
Loading…
kleidiai: kernel interface refactoring
ggml
changes relating to the ggml tensor library for machine learning
#16460
opened Oct 7, 2025 by
chaxu01
Loading…
Add hipblasLt implementation for batched gemm to improve performance for CDNA3 only
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16457
opened Oct 7, 2025 by
peizhang56
Loading…
vulkan: Handle FA with all -inf mask values
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16447
opened Oct 6, 2025 by
jeffbolznv
Loading…
Metal Pool 1D Kernel
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16429
opened Oct 5, 2025 by
ThoreKoritzius
Loading…
fix: add generic fallback to detect trailing <think> tags in Jinja templates and handle forced-open reasoning blocks
testing
Everything test related
#16426
opened Oct 4, 2025 by
ServeurpersoCom
•
Draft
server / ranking : add sorting and management of top_n
examples
server
#16403
opened Oct 3, 2025 by
YannFollet
Loading…
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules
python
python script changes
#16367
opened Oct 1, 2025 by
sfallah
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Update build.md
documentation
Improvements or additions to documentation
#16346
opened Sep 30, 2025 by
refine360-debug
Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU
ggml
changes relating to the ggml tensor library for machine learning
#16333
opened Sep 29, 2025 by
angt
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.