Skip to content

ci : bump cuda release to 13.3#23749

Merged
ggerganov merged 1 commit into
masterfrom
cisc/ci-release-cuda-13-3
May 27, 2026
Merged

ci : bump cuda release to 13.3#23749
ggerganov merged 1 commit into
masterfrom
cisc/ci-release-cuda-13-3

Conversation

@CISC
Copy link
Copy Markdown
Member

@CISC CISC commented May 27, 2026

Overview

Bump Windows CUDA 13 release to 13.3

Additional information

Test run: https://github.com/CISC/llama.cpp/actions/runs/26483256606

Requirements

@CISC CISC requested a review from a team as a code owner May 27, 2026 01:03
@github-actions github-actions Bot added the devops improvements to build systems and github actions label May 27, 2026
Copy link
Copy Markdown
Member

@ggerganov ggerganov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We keep the CUDA 12.4 release for some compatibility with older hardware - is that correct?

@ggerganov ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label May 27, 2026
@CISC
Copy link
Copy Markdown
Member Author

CISC commented May 27, 2026

We keep the CUDA 12.4 release for some compatibility with older hardware - is that correct?

Correct.

@ggerganov ggerganov merged commit 2d0656f into master May 27, 2026
3 checks passed
@ggerganov ggerganov deleted the cisc/ci-release-cuda-13-3 branch May 27, 2026 12:06
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request May 27, 2026
* origin/master:
hexagon: add support for Q4_1 in MUL_MAT and MUL_MAT_ID (ggml-org#23647)
ggml-webgpu: Fix how to dispatch WG to some ops (ggml-org#23750)
vulkan: Switch MUL_MAT_VEC to 4 K per iteration for F16/32 (ggml-org#22887)
vulkan: use GL_NV_cooperative_matrix_decode_vector for faster matmul (ggml-org#23541)
vulkan: add REPEAT op support for f16 to f16. (ggml-org#23298)
ci : move ARM jobs to self-hosted + disable kleidiai mac release (ggml-org#23780)
vendor : update cpp-httplib to 0.46.0 (ggml-org#23650)
pyproject : add conversion folder and update dependencies (ggml-org#23746)
CUDA: restrict PDL to CTK >= 12.3 due to MSVC issues (ggml-org#23742)
ci : bump cuda release to 13.3 (ggml-org#23749)
common : fix env names to all have LLAMA_ARG_ prefix (ggml-org#23778)
ci : fix windows ccaches (ggml-org#23777)
ci : remove wasm test (ggml-org#23733)
vulkan: avoid preferring transfer queue on AMD UMA devices (ggml-org#22455)
ci : add ccache to server builds + fix undefined sanitizer build (ggml-org#23763)
docs : fix duplicated "the" in granitevision and model-conversion docs (ggml-org#23767)
convert: add MiniCPM5 tokenizer support (ggml-org#23384)
server : fix the log message when using SSL (ggml-org#23393)
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

devops improvements to build systems and github actions merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants