UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt by loci-dev · Pull Request #864 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-09T09:41:05Z

When using -v verbose log, we should now see this line:

slot update_slots: id  0 | task 18 | prompt done, n_tokens = 8, batch.n_tokens = 8
slot update_slots: id  1 | task 21 | waiting for parent slot to complete
slot update_slots: id  2 | task 19 | waiting for parent slot to complete
slot update_slots: id  3 | task 20 | waiting for parent slot to complete

loci-review · 2026-01-09T10:29:19Z

Explore the complete analysis inside the Version Insights

Perfect! I've retrieved the summary report for your project. Here are the key findings:

Summary Report for llama.cpp PR #864

Project Information:

Repository: llama.cpp (owner: auroralabs-loci)
Pull Request: UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864
Comparing versions between base and target

Performance Analysis Results:

✅ No significant performance changes detected - The analysis found no modified functions with performance changes greater than 2% threshold for either:

Response Time
Throughput Time

Conclusion:

The changes in PR #864 are performance-neutral, meaning they don't introduce any performance regressions. This is a positive outcome indicating that your code modifications maintain stable performance characteristics without degrading the application's efficiency.

Would you like more detailed information about specific functions or any other aspect of this performance comparison?

loci-review · 2026-01-09T12:32:41Z

Explore the complete analysis inside the Version Insights

Perfect! I've retrieved the summary report for your project. Here are the key findings:

Summary Report for llama.cpp PR #864

Project Information:

Repository: llama.cpp (owner: auroralabs-loci)
Pull Request: UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864
Comparing versions between base and target

Key Findings:

✅ No Significant Performance Changes Detected

The analysis indicates that no modified functions showed performance changes greater than 2% between the base and target versions. This means:

Performance is stable - PR UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864 doesn't introduce any significant performance regressions or improvements
Low performance risk - The code changes are performance-neutral
Safe to merge from a performance perspective

This is a positive result, showing that your changes maintain the existing performance characteristics of the llama.cpp project.

Would you like me to help you with any additional analysis or have questions about specific aspects of this report?

- launch the parent task first so it finds the slot with best cache - parent task waits for child tasks to be launched - when a child task finishes - remove its cache

loci-review · 2026-01-09T14:32:46Z

Explore the complete analysis inside the Version Insights

Perfect! I've retrieved the summary report for your project. Here are the key findings:

Summary Report for llama.cpp PR #864

Project Information:

Repository: llama.cpp (owner: auroralabs-loci)
Pull Request: UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864
Comparing versions between base and target

Key Finding: ✅ No Significant Performance Impact

The analysis found that no modified functions showed performance changes greater than 2% in either:

Response Time (execution time per function call)
Throughput Time (time including callees)

What This Means:

The changes in PR UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864 maintain performance stability
No performance regressions were detected
The modifications appear to be functionally focused without impacting performance-critical code paths

This is a positive result indicating that your code changes are safe from a performance perspective and can proceed without concerns about performance degradation.

Would you like more detailed information about specific functions or any other aspect of this analysis?

loci-review · 2026-01-09T16:27:07Z

Explore the complete analysis inside the Version Insights

Perfect! I've retrieved the summary report for your project. Here are the key findings:

Summary Report for llama.cpp PR #864

Project Information:

Repository: llama.cpp (owner: auroralabs-loci)
Pull Request: UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt #864
Comparing versions between base and target

Performance Analysis Results:

✅ No Significant Performance Impact Detected

The analysis found that:

No modified functions showed performance changes greater than 2% threshold
Both Response Time and Throughput Time metrics remained stable
All performance variations are within acceptable limits (±2%)

Conclusion:

This pull request appears safe to merge from a performance perspective, as it maintains stable performance characteristics without introducing any regressions to the llama.cpp codebase.

Would you like more detailed information about specific functions or any other aspect of this performance comparison?

ngxson and others added 4 commits January 7, 2026 13:13

server: fix n_cmpl not skipping processing

a9d7bcb

fix infinite loop on empty batch

d7c27d4

Merge branch 'master' into HEAD

59dda88

cont : init child samplers + modify child logic

439c3b5

loci-dev temporarily deployed to PROD__AL_DEMO January 9, 2026 09:41 — with GitHub Actions Inactive

Merge branch 'master' into pr/18663

91fd50b

loci-dev force-pushed the main branch from 8271a31 to 12cf436 Compare January 9, 2026 11:09

cont : cleanup

f2d988d

loci-dev temporarily deployed to PROD__AL_DEMO January 9, 2026 11:36 — with GitHub Actions Inactive

cont : improve n_cmpl logic

a4854f0

- launch the parent task first so it finds the slot with best cache - parent task waits for child tasks to be launched - when a child task finishes - remove its cache

loci-dev temporarily deployed to PROD__AL_DEMO January 9, 2026 13:44 — with GitHub Actions Inactive

ggerganov added 3 commits January 9, 2026 16:42

cont : remove redundant function

9ceb268

cont : reduce parent checks

aef22e7

fix : nullptr task dereference

cc5cafe

loci-dev temporarily deployed to PROD__AL_DEMO January 9, 2026 15:39 — with GitHub Actions Inactive

loci-dev force-pushed the main branch from 12cf436 to 6c70c0f Compare January 9, 2026 16:10

loci-dev force-pushed the main branch 10 times, most recently from 5dbcd6b to b63d258 Compare January 10, 2026 18:11

loci-dev force-pushed the main branch 30 times, most recently from b96fcb2 to 9e5f0e1 Compare January 19, 2026 23:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt#864

UPSTREAM PR #18663: server: fix n_cmpl not skipping processing prompt#864
loci-dev wants to merge 10 commits intomainfrom
upstream-PR18663-branch_ngxson-xsn/fix_n_cmpl

loci-dev commented Jan 9, 2026

Uh oh!

loci-review bot commented Jan 9, 2026

Uh oh!

loci-review bot commented Jan 9, 2026

Uh oh!

loci-review bot commented Jan 9, 2026

Uh oh!

loci-review bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

loci-dev commented Jan 9, 2026

Uh oh!

loci-review bot commented Jan 9, 2026

Summary Report for llama.cpp PR #864

Uh oh!

loci-review bot commented Jan 9, 2026

Summary Report for llama.cpp PR #864

Uh oh!

loci-review bot commented Jan 9, 2026

Summary Report for llama.cpp PR #864

Uh oh!

loci-review bot commented Jan 9, 2026

Summary Report for llama.cpp PR #864

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants