UPSTREAM PR #19227: Fix Issue !19219 by loci-dev · Pull Request #1100 · auroralabs-loci/llama.cpp

loci-dev · 2026-01-31T07:40:55Z

Note

Source pull request: ggml-org/llama.cpp#19227

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x (Issue #19219). Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x. Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

loci-review · 2026-01-31T08:28:12Z

No meaningful performance changes were detected across 115327 analyzed functions in the following binaries: build.bin.libllama.so, build.bin.llama-tts, build.bin.llama-cvector-generator, build.bin.libmtmd.so, build.bin.libggml.so, build.bin.libggml-cpu.so, build.bin.libggml-base.so, build.bin.llama-bench, build.bin.llama-gemma3-cli, build.bin.llama-gguf-split, build.bin.llama-llava-cli, build.bin.llama-minicpmv-cli, build.bin.llama-quantize, build.bin.llama-tokenize, build.bin.llama-qwen2vl-cli.

🔎 Full breakdown: Loci Inspector.
💬 Questions? Tag @loci-dev.

Fix Issue !19219

027ef82

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x. Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

loci-dev temporarily deployed to PROD__AL_DEMO January 31, 2026 07:40 — with GitHub Actions Inactive

loci-dev force-pushed the main branch from 5fea2ef to 8a7ef20 Compare January 31, 2026 08:12

loci-dev force-pushed the main branch 26 times, most recently from 237828b to b128b33 Compare February 1, 2026 11:09

loci-dev force-pushed the main branch 30 times, most recently from cd152fa to ab12294 Compare February 3, 2026 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #19227: Fix Issue !19219#1100

UPSTREAM PR #19227: Fix Issue !19219#1100
loci-dev wants to merge 1 commit intomainfrom
loci/pr-19227-revert_scale_queue_env

loci-dev commented Jan 31, 2026

Uh oh!

loci-review bot commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Jan 31, 2026

Uh oh!

loci-review bot commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants