generator: llama/gguf #568

leondz · 2024-03-22T14:59:29Z

ggml is an initiative to build small models through quantisation of parameters to smaller datatypes (ggml.ai)

Garak supports an old version of their interface, ggml files + llama.cpp

Things have move forward with ggml. The llama/ggml connector in garak seems to be problematic/broken now, and also, a new gguf file format has emerged.

We should

support gguf
fix or remove the existing ggml file interface
consider having some kind of test for this, depending on the degree of orchestration required

leondz · 2024-03-22T15:17:19Z

see #540 , #474

leondz added the generators Interfaces with LLMs label Mar 22, 2024

leondz assigned jmartin-tech Mar 22, 2024

leondz added this to the release 0.9.1 milestone Mar 24, 2024

This was referenced Mar 24, 2024

GGUF #540

Closed

bug: GGML generator non-zero exit code #474

Closed

jmartin-tech mentioned this issue Apr 2, 2024

Convert GGML to expect GGUF format #581

Merged

leondz linked a pull request Apr 4, 2024 that will close this issue

Convert GGML to expect GGUF format #581

Merged

leondz closed this as completed in #581 Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generator: llama/gguf #568

generator: llama/gguf #568

leondz commented Mar 22, 2024 •

edited

Loading

leondz commented Mar 22, 2024

generator: llama/gguf #568

generator: llama/gguf #568

Comments

leondz commented Mar 22, 2024 • edited Loading

leondz commented Mar 22, 2024

leondz commented Mar 22, 2024 •

edited

Loading