UPSTREAM PR #18789: server: improve slots scheduling for n_cmpl by loci-dev · Pull Request #928 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-15T10:41:11Z

This PR introduces scheduling mechanism inspired by thread barrier, which allow launching n_cmpl slots at the same time.

I tested with repeated requests to /v1/completions using the following payload:

{
    "prompt": "I believe the meaning of life is",
    "stream": false,
    "n": 3,
    "n_predict": 100,
    "id_slot": 0
}

And so far it works correctly

loci-review · 2026-01-15T11:40:53Z

Explore the complete analysis inside the Version Insights

Based on the analysis, no functions were identified with meaningful performance changes between the base and target versions. The code modifications did not result in measurable performance impact.

loci-review · 2026-01-15T12:21:23Z

Explore the complete analysis inside the Version Insights

Based on the analysis, no functions were identified with meaningful performance changes between the base and target versions. The code modifications did not result in measurable performance impact.

ngxson added 11 commits January 12, 2026 17:41

server : make sure children tasks are scheduled to launch with parent

b55964a

fix

e32545f

add comment pointing to this PR

821e329

fix

25702ba

clean up

9481b9d

more debug messages

f0349e4

add pop_deferred_task with specific ID version

da6e2ba

improve the logic

d6b0d23

simple approach

ba86ad9

no double move

79c1967

Merge branch 'master' into xsn/n_cmpl_sync_barrier

d5505b1

loci-dev temporarily deployed to PROD__AL_DEMO January 15, 2026 10:41 — with GitHub Actions Inactive

correct return type of launch_slots_with_parent_task

8b5474a

loci-dev temporarily deployed to PROD__AL_DEMO January 15, 2026 11:36 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 14 times, most recently from 74ffea9 to 839190f Compare January 18, 2026 00:41

loci-dev force-pushed the main branch 30 times, most recently from 0da3c3b to 90caac4 Compare January 27, 2026 03:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #18789: server: improve slots scheduling for n_cmpl#928

UPSTREAM PR #18789: server: improve slots scheduling for n_cmpl#928
loci-dev wants to merge 12 commits intomainfrom
upstream-PR18789-branch_ngxson-xsn/n_cmpl_sync_barrier

loci-dev commented Jan 15, 2026

Uh oh!

loci-review bot commented Jan 15, 2026

Uh oh!

loci-review bot commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Jan 15, 2026

Uh oh!

loci-review bot commented Jan 15, 2026

Uh oh!

loci-review bot commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants