UPSTREAM PR #18547: context : reserve new scheduler when graph topology changes by loci-dev · Pull Request #792 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-02T17:36:25Z

cont #17617

In some cases we know that a graph reallocation would be necessary (see ggml-org/llama.cpp#17617). Re-reserve the scheduler to reduce the amount of unexpected graph reallocations and to prevent further reallocations later.

loci-review · 2026-01-02T18:31:38Z

Explore the complete analysis inside the Version Insights

I apologize for the error. It seems there was an issue with the summary agent. Let me try to help you with this request in a different way.

Based on the information you've provided, you're looking for a summary report for:

Project ID: 2621b8c0-b5ce-11f0-b333-453f42058aa1
Report ID: 20f8d650-e803-11f0-81f2-dbb430499cb5
Version ID: 20f8d651-e803-11f0-81f2-dbb430499cb5
Base Version ID: 17a082b1-e7fe-11f0-81f2-dbb430499cb5
Repository: llama.cpp
Owner: auroralabs-loci
Pull Request: UPSTREAM PR #18547: context : reserve new scheduler when graph topology changes #792

Unfortunately, the summary agent encountered an error. Could you please clarify:

Do you have access to a different endpoint or API to retrieve this summary report?
Would you like me to help you in another way with this information?

loci-review · 2026-01-04T10:21:07Z

Explore the complete analysis inside the Version Insights

Perfect! I've successfully retrieved the summary report for your project. The report shows a performance analysis of pull request #792 for the llama.cpp repository, comparing two versions of the code.

Key Highlights:

Most Significant Impact: The operator- iterator function shows a 79% increase in response time (from 92.71ms to 165.98ms)
Pattern Identified: Most affected functions are C++ STL operations (iterators, vectors, unique pointers)
Overall Trend: All top 8 functions show performance degradation in response time, ranging from 2% to 79%

The analysis suggests that changes in PR #792 may have introduced additional overhead in container operations or iterator usage within the llama.cpp codebase.

Would you like more detailed information about any specific function or aspect of this performance report?

loci-review · 2026-01-05T13:31:10Z

Explore the complete analysis inside the Version Insights

I've successfully retrieved the summary report for your project. The report shows significant performance regressions in PR #792 for the llama.cpp repository.

Here are the key highlights:

Critical Issues:

_M_const_cast: 217% increase in response time (worst performer)
operator-: 80.4% increase in response time
begin (vector): 68.3% increase in response time

Main Concerns:

All top 10 functions show performance degradation
Most affected functions are STL container operations (iterators, trees, vectors, hashtables)
Throughput has decreased significantly across all measured functions

Recommendation: The report strongly suggests investigating PR #792 before merging, as it introduces substantial performance regressions in core STL operations that could impact overall application performance.

loci-review · 2026-01-12T15:30:36Z

Explore the complete analysis inside the Version Insights

I've generated a comprehensive summary report for your project. The report shows that Pull Request #792 for the llama.cpp repository introduces significant performance changes:

Key Highlights:

✅ Major Throughput Improvements: Most functions show substantial throughput increases ranging from 17% to 289%

⚠️ Response Time Trade-offs: Some critical functions (like std::vector::begin and back) show increased response times (up to 215%) despite throughput gains

📈 Best Performers:

_M_check_len: -40.60% response time with +17.40% throughput
Various vector operations showing 100%+ throughput improvements

The report includes detailed analysis of the top 10 functions by performance change, key observations, and recommendations for moving forward with this pull request.

loci-dev temporarily deployed to PROD__AL_DEMO January 2, 2026 17:36 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 8 times, most recently from ca0d661 to 594833d Compare January 3, 2026 22:08

loci-dev force-pushed the upstream-PR18547-branch_ggml-org-gg/llama-reserve branch from 4aa4588 to c92df39 Compare January 4, 2026 09:36

loci-dev temporarily deployed to PROD__AL_DEMO January 4, 2026 09:36 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 7 times, most recently from c76f9f8 to 54fde76 Compare January 5, 2026 11:09

loci-dev force-pushed the upstream-PR18547-branch_ggml-org-gg/llama-reserve branch from c92df39 to cf2b3ca Compare January 5, 2026 12:46

loci-dev temporarily deployed to PROD__AL_DEMO January 5, 2026 12:46 — with GitHub Actions Inactive

loci-dev force-pushed the main branch from 54fde76 to 7b2e535 Compare January 5, 2026 13:21

loci-dev force-pushed the main branch 6 times, most recently from 118039a to cd122e2 Compare January 6, 2026 07:13

loci-dev force-pushed the main branch 22 times, most recently from 8b542ba to 4f98d83 Compare January 12, 2026 13:22

ggerganov added 7 commits January 12, 2026 16:37

context : reserve new scheduler when graph topology changes

e115c63

cont : fix

7b52642

cont : fix reserve

94426b2

cont : reserve only when changes occur + timing

03e9d66

context : add comments

5260bb7

llama : reserve on sampler changes

0c0d0fd

common : allow null common_sampler

b579b97

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #18547: context : reserve new scheduler when graph topology changes#792

UPSTREAM PR #18547: context : reserve new scheduler when graph topology changes#792
loci-dev wants to merge 7 commits intomainfrom
upstream-PR18547-branch_ggml-org-gg/llama-reserve

loci-dev commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 4, 2026

Uh oh!

loci-review bot commented Jan 5, 2026

Uh oh!

loci-review bot commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 2, 2026

Uh oh!

loci-review bot commented Jan 4, 2026

Uh oh!

loci-review bot commented Jan 5, 2026

Uh oh!

loci-review bot commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants