Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

cmake : enable curl by default build Compilation issues devops improvements to build systems and github actions examples server
#12761 opened Apr 4, 2025 by ngxson Loading…
opencl: better identify Adreno GPU ggml changes relating to the ggml tensor library for machine learning
#12760 opened Apr 4, 2025 by lhez Loading…
clip : refactor clip_init, add tests examples
#12757 opened Apr 4, 2025 by ngxson Loading…
Added all CPU to Docker GPU images for 'token_embd.weight' compatibility devops improvements to build systems and github actions
#12749 opened Apr 4, 2025 by rudiservo Loading…
(wip) support ultravox audio input examples python python script changes
#12745 opened Apr 3, 2025 by ngxson Draft
sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12734 opened Apr 3, 2025 by zhouwg Loading…
sync : ggml ggml changes relating to the ggml tensor library for machine learning script Script related
#12732 opened Apr 3, 2025 by ggerganov Loading…
CANN: Refactor to reduce duplicate code ggml changes relating to the ggml tensor library for machine learning
#12731 opened Apr 3, 2025 by hipudding Draft
Update llama-quant.cpp llama_tensor_get_type with DeepSeek friendly modifications ggml changes relating to the ggml tensor library for machine learning
#12727 opened Apr 3, 2025 by bartowski1182 Loading…
DeepSeek V2/V3 with -mla option examples python python script changes server
#12725 opened Apr 2, 2025 by jukofyork Draft
vulkan: Use unclamped loads for flash attention mask ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#12720 opened Apr 2, 2025 by jeffbolznv Loading…
Fix: Abnormal exit on Android devices ggml changes relating to the ggml tensor library for machine learning
#12712 opened Apr 2, 2025 by biyou Loading…
WIP: Add support for CogAgent examples python python script changes server
#12679 opened Mar 31, 2025 by Tianyue-Zhao Draft
update rope_multi: ggml changes relating to the ggml tensor library for machine learning
#12665 opened Mar 31, 2025 by foldl Loading…
llama : nit, DeepSeek V1 MoE is 16B and GigaChat is 20B
#12652 opened Mar 30, 2025 by CISC Loading…
tts : implement sesame CSM + Mimi decoder examples python python script changes
#12648 opened Mar 29, 2025 by ngxson Loading…
opencl: remove a self-referential macro ggml changes relating to the ggml tensor library for machine learning
#12626 opened Mar 28, 2025 by linehill Loading…
opencl: Add support for multiple devices ggml changes relating to the ggml tensor library for machine learning
#12622 opened Mar 28, 2025 by linehill Draft
Enable MMA for BF16 data types on Powerpc ggml changes relating to the ggml tensor library for machine learning
#12565 opened Mar 25, 2025 by shalinib-ibm Draft
ggml-quants : weighted rounding algorithms with cumulative search generation quality Quality of model output ggml changes relating to the ggml tensor library for machine learning Less than 4 bits Efforts related to viable quantized models using <4 bits research 🔬 Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
#12557 opened Mar 25, 2025 by compilade Draft
Draft: vulkan: Add bfloat16 support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12554 opened Mar 24, 2025 by jeffbolznv Loading…
ProTip! Follow long discussions with comments:>50.