Skip to content

Conversation

@saood06
Copy link
Collaborator

@saood06 saood06 commented Feb 23, 2025

Port of ggml-org/llama.cpp@9488fbf

This is a good tool to benchmark with as requested by #223.

As a very quick demo I generated this, just by running this ( ./llama-sweep-bench -c 2048 -ub 512 -m WizardLM-2-8x22B-IQ4_K_R4.gguf -ctk q8_KV -ctv q8_0 -fa --output-format jsonl and then sweep-bench-plot.py with the output).

performance_comparison_pp

performance_comparison_tg

  • Self-reported review complexity:
    • Low
    • Medium
    • High

Copy link
Owner

@ikawrakow ikawrakow left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this - can be very useful.

@saood06 saood06 merged commit 46bf73a into main Feb 23, 2025
Nexesenex added a commit to Nexesenex/ik_llama.cpp.nxs that referenced this pull request Mar 3, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Apr 26, 2025
@ubergarm
Copy link
Contributor

@saood06 thanks I'm a convert to llama-sweep-bench! It is indeed very useful.

I pushed a branch on my personal mainline llama.cpp fork just to use for testing performance across forks. I don't plan to open a PR to mainline, but just left it up there in case anyone else is using it. I'm guessing ik has something similar as we were comparing the new GLM-4 performance.

Thanks!

ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 3, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 4, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 7, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 13, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 18, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request May 30, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jun 22, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jun 27, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jun 28, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jul 2, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jul 10, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jul 11, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Jul 12, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 5, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 10, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 14, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 17, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 24, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 25, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 28, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Aug 29, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 2, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 5, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 11, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 22, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 25, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 26, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Sep 28, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Oct 1, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Oct 13, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Oct 15, 2025
ubergarm added a commit to ubergarm/llama.cpp that referenced this pull request Oct 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants