Add examples #4

AlexKoff88 · 2023-12-13T07:01:41Z

The API of the repo allows benchmarking accuracy of many optimization backends including:

HuggingFace (4-bit via Bytesandbits)
GPTQ
LLama.cpp (via bigdl-llm)
OpenVINO (via optimum)

I suggest creating examples for all four backends and demonstrating capabilities. We can also do a comparison and publish in readme.
You can use optimized versions of llama-7B from here: https://huggingface.co/TheBloke

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add examples #4

Add examples #4

AlexKoff88 commented Dec 13, 2023 •

edited by andreyanufr

Loading

Add examples #4

Add examples #4

Comments

AlexKoff88 commented Dec 13, 2023 • edited by andreyanufr Loading

AlexKoff88 commented Dec 13, 2023 •

edited by andreyanufr

Loading