UPSTREAM PR #18548: graph : reduce topology branching by loci-dev · Pull Request #790 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-02T13:42:16Z

ref #18547

Reduce the amount of graph topology changes when switching between token and embedding inputs:

// before
if (ubatch.token) {
    inpL = ggml_scale(ctx0, inpL, sqrtf(n_embd));
    cb(inpL, "inp_scaled", -1);
}

// after
inpL = ggml_scale(ctx0, inpL, ubatch.token ? sqrtf(n_embd) : 1.0f);
cb(inpL, "inp_scaled", -1);

loci-review · 2026-01-02T14:28:18Z

Explore the complete analysis inside the Version Insights

Perfect! I've retrieved the summary report for your project. Here are the key findings:

Summary Report for llama.cpp PR #790

Project Information:

Repository: llama.cpp (owner: auroralabs-loci)
Pull Request: UPSTREAM PR #18548: graph : reduce topology branching #790
Comparing versions between base and target

Performance Analysis Results:

✅ No Significant Performance Regressions Detected

The analysis shows that Pull Request #790 maintains performance parity with the base version:

Response Time Changes: No modified functions with performance changes > 2%
Throughput Time Changes: No modified functions with performance changes > 2%

Conclusion:
This is a positive result indicating that the changes in PR #790 are performance-neutral. The modifications don't negatively impact the performance of the llama.cpp project, suggesting the changes are likely focused on functionality improvements, bug fixes, or refactoring while maintaining the existing performance characteristics.

graph : reduce topology branching

4ed59dc

loci-dev temporarily deployed to PROD__AL_DEMO January 2, 2026 13:42 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 27 times, most recently from c5b5703 to dc4341b Compare January 6, 2026 17:10

loci-dev force-pushed the main branch 30 times, most recently from 3210e33 to 9ffcb53 Compare January 12, 2026 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #18548: graph : reduce topology branching#790

UPSTREAM PR #18548: graph : reduce topology branching#790
loci-dev wants to merge 1 commit intomainfrom
upstream-PR18548-branch_ggml-org-gg/graph-avoid-branches

loci-dev commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 2, 2026

Summary Report for llama.cpp PR #790

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants