feat: add Vulkan REPEAT op support for f16 to f16. by l8bloom · Pull Request #23298 · ggml-org/llama.cpp

l8bloom · 2026-05-18T21:51:26Z

Overview

Add Vulkan REPEAT op support for f16 to f16.

(Please advise if the PR is redundant and/or missing steps to full implementation)

Additional information

Getting:

[INFO ] stable-diffusion.cpp:4777 - sampling completed, taking 277.99s
[DEBUG] ggml_extend.hpp:1904 - ltx_audio_vae compute buffer size: 90.68 MB(VRAM)
[INFO ] ggml_extend.hpp:2142 - ltx_audio_vae offload params (339.87 MB, 1285 tensors) to runtime backend (Vulkan0), taking 0.08s
[ERROR] ggml_vulkan: Error: Missing op: REPEAT for f16 to f16 ~/Projects/local_ai/stable-diffusion.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:9968: fatal error
[New LWP 233602]
[New LWP 233600]
[New LWP 233586]
[New LWP 233471]
[New LWP 233470]
[New LWP 233469]
[New LWP 233468]

while running video generation(stable-diffusion.cpp - relies on ggml) with the following models:

Diffusion Model: ltx-2.3-22b-dev-UD-Q4_K_M.gguf
Video VAE: ltx-2.3-22b-dev_video_vae.safetensors
Audio VAE: ltx-2.3-22b-dev_audio_vae.safetensors

audio-vae encoding works fine on CPU, but breaks with the message above on Linux-Vulkan

Reusing pipeline_repeat_f32 worked fine, but looks dubious.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: No AI

l8bloom · 2026-05-25T19:05:15Z

Hi, @jeffbolznv can I help with the failing CI actions?
ubuntu-cpu-riscv64-native and ubuntu-cpu (x64, ubuntu-22.04) look like could pass if re-run.

jeffbolznv · 2026-05-25T20:26:55Z

They're probably unrelated, I wouldn't worry about them.

0cc4m · 2026-05-27T13:34:40Z

@jeffbolznv can you approve as well?

* origin/master: hexagon: add support for Q4_1 in MUL_MAT and MUL_MAT_ID (ggml-org#23647) ggml-webgpu: Fix how to dispatch WG to some ops (ggml-org#23750) vulkan: Switch MUL_MAT_VEC to 4 K per iteration for F16/32 (ggml-org#22887) vulkan: use GL_NV_cooperative_matrix_decode_vector for faster matmul (ggml-org#23541) vulkan: add REPEAT op support for f16 to f16. (ggml-org#23298) ci : move ARM jobs to self-hosted + disable kleidiai mac release (ggml-org#23780) vendor : update cpp-httplib to 0.46.0 (ggml-org#23650) pyproject : add conversion folder and update dependencies (ggml-org#23746) CUDA: restrict PDL to CTK >= 12.3 due to MSVC issues (ggml-org#23742) ci : bump cuda release to 13.3 (ggml-org#23749) common : fix env names to all have LLAMA_ARG_ prefix (ggml-org#23778) ci : fix windows ccaches (ggml-org#23777) ci : remove wasm test (ggml-org#23733) vulkan: avoid preferring transfer queue on AMD UMA devices (ggml-org#22455) ci : add ccache to server builds + fix undefined sanitizer build (ggml-org#23763) docs : fix duplicated "the" in granitevision and model-conversion docs (ggml-org#23767) convert: add MiniCPM5 tokenizer support (ggml-org#23384) server : fix the log message when using SSL (ggml-org#23393)

* feat: extend repeat op for vulkan * feat: add repeat_f16 vulkan pipeline * fix: ensure same dst and src types * fix: use type_size instead of data types * fix: use int16 and int32 for repeat shader op * chore: rename repeat_f* to repeat_i* * chore: rename repeat vulkan pipelines

feat: extend repeat op for vulkan

70c8d5f

l8bloom requested a review from a team as a code owner May 18, 2026 21:51

github-actions Bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels May 18, 2026

jeffbolznv requested changes May 18, 2026

View reviewed changes

Comment thread ggml/src/ggml-vulkan/ggml-vulkan.cpp Outdated

Comment thread ggml/src/ggml-vulkan/ggml-vulkan.cpp Outdated

l8bloom added 2 commits May 19, 2026 12:58

feat: add repeat_f16 vulkan pipeline

36c6088

fix: ensure same dst and src types

0fe360b

0cc4m reviewed May 19, 2026

View reviewed changes

Comment thread ggml/src/ggml-vulkan/ggml-vulkan.cpp Outdated

jeffbolznv reviewed May 19, 2026

View reviewed changes

Comment thread ggml/src/ggml-vulkan/ggml-vulkan.cpp Outdated

l8bloom added 2 commits May 19, 2026 17:38

fix: use type_size instead of data types

81d29f4

fix: use int16 and int32 for repeat shader op

7c846a3

jeffbolznv reviewed May 20, 2026

View reviewed changes

Comment thread ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp Outdated

l8bloom added 2 commits May 20, 2026 16:23

chore: rename repeat_f* to repeat_i*

f93e281

chore: rename repeat vulkan pipelines

b5e2932

l8bloom requested a review from jeffbolznv May 25, 2026 19:05

0cc4m approved these changes May 27, 2026

View reviewed changes

jeffbolznv approved these changes May 27, 2026

View reviewed changes

0cc4m merged commit 837bb6b into ggml-org:master May 27, 2026
47 of 50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Vulkan REPEAT op support for f16 to f16.#23298

feat: add Vulkan REPEAT op support for f16 to f16.#23298
0cc4m merged 7 commits into
ggml-org:masterfrom
l8bloom:feat/add-repeat-op-for-f16-to-f16

l8bloom commented May 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

l8bloom commented May 25, 2026

Uh oh!

jeffbolznv commented May 25, 2026

Uh oh!

0cc4m commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

l8bloom commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

l8bloom commented May 25, 2026

Uh oh!

jeffbolznv commented May 25, 2026

Uh oh!

0cc4m commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

l8bloom commented May 18, 2026 •

edited

Loading