-
Notifications
You must be signed in to change notification settings - Fork 13.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: add more num_blocks instantiations in rms_norm
#17701
opened Dec 2, 2025 by
jeffbolznv
Loading…
llama-server: fix duplicate HTTP headers in multiple models mode
examples
server
#17698
opened Dec 2, 2025 by
ServeurpersoCom
Loading…
model : add ASR support for LFM2-Audio-1.5B
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
testing
Everything test related
ggml-zendnn : add ZenDNN backend for AMD CPUs
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
#17690
opened Dec 2, 2025 by
z-vishal
Loading…
Use OpenAI-compatible
/v1/models endpoint by default
examples
server
#17689
opened Dec 2, 2025 by
allozaur
Loading…
Document how to compile with Vulkan using Debian/Ubuntu packages
documentation
Improvements or additions to documentation
#17688
opened Dec 2, 2025 by
socram8888
Loading…
vulkan: enable mmvq for q2_k on NVIDIA
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17675
opened Dec 2, 2025 by
jeffbolznv
Loading…
vulkan: perf_logger improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17672
opened Dec 2, 2025 by
jeffbolznv
Loading…
vulkan: fix top_k bug when there are ties in the input
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17659
opened Dec 1, 2025 by
jeffbolznv
Loading…
ggml: use 'exists( const std::filesystem::path&, std::error_code&)' instead of 'exists( const std::filesystem::path&)' to enhance robustness
ggml
changes relating to the ggml tensor library for machine learning
#17653
opened Dec 1, 2025 by
flyinskyin2013
Loading…
ggml: added missing cast sections in memcpy
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17651
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-cpu: remove duplicate conditional check 'iid'
ggml
changes relating to the ggml tensor library for machine learning
#17650
opened Dec 1, 2025 by
GermanAizek
Loading…
gguf: llama: use changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
= default for trivial constructors and destructors
ggml
#17649
opened Dec 1, 2025 by
GermanAizek
Loading…
sgemm: reuse loaded vector in AVX dot product calculation
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17648
opened Dec 1, 2025 by
GermanAizek
Loading…
llama-vocab: replace postfix with prefix increment for iterators
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17646
opened Dec 1, 2025 by
GermanAizek
Loading…
vec: optimize AVX2/FMA sum-of-squares with loop unrolling and FMA
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17642
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-quants: use _mm256_testz_si256 for mask checks in AVX2
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17641
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-alloc: optimize free block shifting with changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
memmove
ggml
#17640
opened Dec 1, 2025 by
GermanAizek
Loading…
vulkan: Replace deprecated VK_EXT_validation_features
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17637
opened Dec 1, 2025 by
rillomas
Loading…
common : compute average token length from vocabulary
#17632
opened Dec 1, 2025 by
yifant-code
•
Draft
llama-router, the C++ "llama-swap" for llama.cpp
examples
need feedback
Testing and feedback with results are needed
server
testing
Everything test related
#17629
opened Nov 30, 2025 by
ServeurpersoCom
•
Draft
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.