ci : bump cuda release to 13.3 by CISC · Pull Request #23749 · ggml-org/llama.cpp

CISC · 2026-05-27T01:03:07Z

Overview

Bump Windows CUDA 13 release to 13.3

Additional information

Test run: https://github.com/CISC/llama.cpp/actions/runs/26483256606

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: ala

ggerganov

We keep the CUDA 12.4 release for some compatibility with older hardware - is that correct?

CISC · 2026-05-27T11:54:03Z

We keep the CUDA 12.4 release for some compatibility with older hardware - is that correct?

Correct.

* origin/master: hexagon: add support for Q4_1 in MUL_MAT and MUL_MAT_ID (ggml-org#23647) ggml-webgpu: Fix how to dispatch WG to some ops (ggml-org#23750) vulkan: Switch MUL_MAT_VEC to 4 K per iteration for F16/32 (ggml-org#22887) vulkan: use GL_NV_cooperative_matrix_decode_vector for faster matmul (ggml-org#23541) vulkan: add REPEAT op support for f16 to f16. (ggml-org#23298) ci : move ARM jobs to self-hosted + disable kleidiai mac release (ggml-org#23780) vendor : update cpp-httplib to 0.46.0 (ggml-org#23650) pyproject : add conversion folder and update dependencies (ggml-org#23746) CUDA: restrict PDL to CTK >= 12.3 due to MSVC issues (ggml-org#23742) ci : bump cuda release to 13.3 (ggml-org#23749) common : fix env names to all have LLAMA_ARG_ prefix (ggml-org#23778) ci : fix windows ccaches (ggml-org#23777) ci : remove wasm test (ggml-org#23733) vulkan: avoid preferring transfer queue on AMD UMA devices (ggml-org#22455) ci : add ccache to server builds + fix undefined sanitizer build (ggml-org#23763) docs : fix duplicated "the" in granitevision and model-conversion docs (ggml-org#23767) convert: add MiniCPM5 tokenizer support (ggml-org#23384) server : fix the log message when using SSL (ggml-org#23393)

bump cuda release to 13.3

c7d7c22

CISC requested a review from a team as a code owner May 27, 2026 01:03

github-actions Bot added the devops improvements to build systems and github actions label May 27, 2026

ggerganov approved these changes May 27, 2026

View reviewed changes

ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label May 27, 2026

ggerganov merged commit 2d0656f into master May 27, 2026
3 checks passed

ggerganov deleted the cisc/ci-release-cuda-13-3 branch May 27, 2026 12:06

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

ci : bump cuda release to 13.3 (ggml-org#23749)

299dbc8

turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026

ci : bump cuda release to 13.3 (ggml-org#23749)

d5336db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci : bump cuda release to 13.3#23749

ci : bump cuda release to 13.3#23749
ggerganov merged 1 commit into
masterfrom
cisc/ci-release-cuda-13-3

CISC commented May 27, 2026

Uh oh!

ggerganov left a comment

Uh oh!

CISC commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CISC commented May 27, 2026

Overview

Additional information

Requirements

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

CISC commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants