-
Notifications
You must be signed in to change notification settings - Fork 11.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
opencl: better identify Adreno GPU
ggml
changes relating to the ggml tensor library for machine learning
#12760
opened Apr 4, 2025 by
lhez
Loading…
Added all CPU to Docker GPU images for 'token_embd.weight' compatibility
devops
improvements to build systems and github actions
#12749
opened Apr 4, 2025 by
rudiservo
Loading…
sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12734
opened Apr 3, 2025 by
zhouwg
Loading…
sync : ggml
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
#12732
opened Apr 3, 2025 by
ggerganov
Loading…
CANN: Refactor to reduce duplicate code
ggml
changes relating to the ggml tensor library for machine learning
Update llama-quant.cpp llama_tensor_get_type with DeepSeek friendly modifications
ggml
changes relating to the ggml tensor library for machine learning
#12727
opened Apr 3, 2025 by
bartowski1182
Loading…
vulkan: Use unclamped loads for flash attention mask
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12720
opened Apr 2, 2025 by
jeffbolznv
Loading…
imatrix: add option to display importance score statistics for a given imatrix file
examples
#12718
opened Apr 2, 2025 by
EAddario
Loading…
Fix: Abnormal exit on Android devices
ggml
changes relating to the ggml tensor library for machine learning
#12712
opened Apr 2, 2025 by
biyou
Loading…
[RFC][WIP] Common: Add an Initial Chat Memory Interface/Implementation
examples
server
#12698
opened Apr 1, 2025 by
markhpc
Loading…
WIP: Add support for CogAgent
examples
python
python script changes
server
#12679
opened Mar 31, 2025 by
Tianyue-Zhao
•
Draft
update changes relating to the ggml tensor library for machine learning
rope_multi
:
ggml
#12665
opened Mar 31, 2025 by
foldl
Loading…
tts : implement sesame CSM + Mimi decoder
examples
python
python script changes
#12648
opened Mar 29, 2025 by
ngxson
Loading…
llama-server : implement universal assisted decoding
examples
server
#12635
opened Mar 28, 2025 by
g2mt
Loading…
opencl: remove a self-referential macro
ggml
changes relating to the ggml tensor library for machine learning
#12626
opened Mar 28, 2025 by
linehill
Loading…
opencl: Add support for multiple devices
ggml
changes relating to the ggml tensor library for machine learning
Enable MMA for BF16 data types on Powerpc
ggml
changes relating to the ggml tensor library for machine learning
#12565
opened Mar 25, 2025 by
shalinib-ibm
•
Draft
ggml-quants : weighted rounding algorithms with cumulative search
generation quality
Quality of model output
ggml
changes relating to the ggml tensor library for machine learning
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research 🔬
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Tensor Encoding Scheme
https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
Draft: vulkan: Add bfloat16 support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12554
opened Mar 24, 2025 by
jeffbolznv
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.