Make split mode graph work with vision enabled by ikawrakow · Pull Request #1392 · ikawrakow/ik_llama.cpp

ikawrakow · 2026-03-10T05:56:29Z

ubergarm · 2026-03-10T16:40:12Z

It pulled tip of main this morning, but now with opencode client or through the built in web-ui I'm getting gibberish inference output (from tip of main@cda15bf1):

from main@14492bfd (this PR):

I tested with and without --mmproj but was the same.

I rolled back and tested each recent commit until it worked again at 666ea0e:

$ git log --oneline
cda15bf1 (HEAD -> main, upstream/main, upstream/HEAD) Discard very first compute graph for recurrent models (#1393)
f90b4c2f Full graph parallel for Qwen3.5 (dense and MoE) (#1388)
14492bfd Make split mode graph work with vision enabled (#1392) <--- first broken here
666ea0e9 Revise build instructions for ik_llama.cpp <--- working

Here is my command:

# full offload on 2x RTX A6000 48GB VRAM each
./build/bin/llama-server \
  --alias Qwen3.5-122B-A10B \
  --model "$model" \
  -fa on \
  -c 262144 \
  -sm graph \
  -ngl 99 \
  -ub 4096 -b 4096 \
  --parallel 1 \
  --threads 1 \
  --host 127.0.0.1 \
  --port 8080 \
  --jinja \
  --no-mmap

Not sure if anyone else is seeing this? I'll keep testing to see if I can narrow it down any more.

ubergarm · 2026-03-10T17:12:45Z

Removing --no-mmap fixes the issue as discussed in the linked PR ☝️

I tested and --mmproj is working and reading images correctly!

ikawrakow · 2026-03-10T17:31:27Z

@ubergarm

#1397 should fix the --no-mmap issue.

Make split mode graph work with vision enabled

91ccfb7

ikawrakow merged commit 14492bf into main Mar 10, 2026

ikawrakow mentioned this pull request Mar 10, 2026

Two bugs causing crashes with Qwen3-VL-30B-A3B (MoE vision model): CUDA OOB + n_embd_inp overcounting #1384

Closed

ubergarm mentioned this pull request Mar 10, 2026

Full graph parallel for Qwen3.5 (dense and MoE) #1388

Merged

ikawrakow mentioned this pull request Mar 10, 2026

Argghh #1397

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make split mode graph work with vision enabled#1392

Make split mode graph work with vision enabled#1392
ikawrakow merged 1 commit intomainfrom
ik/fix_sm_graph_with_vision

ikawrakow commented Mar 10, 2026

Uh oh!

ubergarm commented Mar 10, 2026

Uh oh!

ubergarm commented Mar 10, 2026 •

edited

Loading

Uh oh!

ikawrakow commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ikawrakow commented Mar 10, 2026

Uh oh!

ubergarm commented Mar 10, 2026

Uh oh!

ubergarm commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ikawrakow commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ubergarm commented Mar 10, 2026 •

edited

Loading