Skip to content

allow f suffix in CUDA compute capabilities#5067

Merged
boegel merged 4 commits intoeasybuilders:developfrom
bedroge:cuda_cc_regex_allow_f
Dec 9, 2025
Merged

allow f suffix in CUDA compute capabilities#5067
boegel merged 4 commits intoeasybuilders:developfrom
bedroge:cuda_cc_regex_allow_f

Conversation

@bedroge
Copy link
Contributor

@bedroge bedroge commented Dec 9, 2025

CUDA compute capabilities can now also have an f suffix, see https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#feature-set-compiler-targets.

Note that it's also changed in the regex that is used to extract the CUDA architectures from binaries. I wasn't sure if the f suffix would show up there as well, but I tested that by compiling something manually (with -arch compute_100f) and running cuobjdump on it:

Fatbin ptx code:
================
arch = sm_100
code version = [8,8]
host = linux
compile_size = 64bit
compressed
ptxasOptions = 

Fatbin ptx code:
================
arch = sm_100f
code version = [8,8]
host = linux
compile_size = 64bit
compressed
ptxasOptions = 

So, it does indeed have sm_100f.

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel boegel added this to the next release (5.2.0) milestone Dec 9, 2025
Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel boegel merged commit 8814af9 into easybuilders:develop Dec 9, 2025
40 checks passed
@bedroge bedroge deleted the cuda_cc_regex_allow_f branch December 9, 2025 14:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments