Skip to content

{perf}[GCC/13.3.0] kineto v20250715 w/ CUDA 12.6.0#25146

Merged
boegel merged 7 commits intoeasybuilders:developfrom
pavelToman:20260126105429_new_pr_kineto20250715
Feb 12, 2026
Merged

{perf}[GCC/13.3.0] kineto v20250715 w/ CUDA 12.6.0#25146
boegel merged 7 commits intoeasybuilders:developfrom
pavelToman:20260126105429_new_pr_kineto20250715

Conversation

@pavelToman
Copy link
Collaborator

@pavelToman pavelToman commented Jan 26, 2026

(created using eb --new-pr)
extracted from RP: #23436
for: #24328
resolves vscentrum/vsc-software-stack#584

@github-actions github-actions bot added the 2024a issues & PRs related to 2024a common toolchains label Jan 26, 2026
@github-actions
Copy link

github-actions bot commented Jan 26, 2026

Updated software kineto-20250715-GCC-13.3.0-CUDA-12.6.0.eb

Diff against kineto-0.4.0-GCC-12.3.0.eb

easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-12.3.0.eb

diff --git a/easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-12.3.0.eb b/easybuild/easyconfigs/k/kineto/kineto-20250715-GCC-13.3.0-CUDA-12.6.0.eb
index 8bbd957c43..ea9f20e466 100644
--- a/easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-12.3.0.eb
+++ b/easybuild/easyconfigs/k/kineto/kineto-20250715-GCC-13.3.0-CUDA-12.6.0.eb
@@ -1,29 +1,32 @@
 easyblock = 'CMakeMake'
 
 name = 'kineto'
-version = '0.4.0'
+version = '20250715'
+local_commit = '16e2a56'
+versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/pytorch/kineto'
 description = "A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters"
 
-toolchain = {'name': 'GCC', 'version': '12.3.0'}
+toolchain = {'name': 'GCC', 'version': '13.3.0'}
 
 source_urls = ['https://github.com/pytorch/kineto/archive/']
 sources = [{
     'git_config': {
         'url': 'https://github.com/pytorch',
         'repo_name': name,
-        'tag': 'v%(version)s',
+        'commit': local_commit,
         'recursive': True,
     },
     'filename': SOURCE_TAR_XZ,
 }]
-checksums = ['5f1c744d57fdc40878b0ff87400097b04ed3027757bcccf6d4f8ceecc8e29855']
+checksums = ['bc44d710104b879f6982ffd4d79a8ba6c97c8aa5fefbcfd85c317d541ed1094d']
 
 builddependencies = [
-    ('CMake', '3.26.3'),
-    ('Python', '3.11.3'),
+    ('CMake', '3.29.3'),
+    ('Python', '3.12.3'),
 ]
+dependencies = [('CUDA', '12.6.0', '', SYSTEM)]
 
 start_dir = 'libkineto'
 
Diff against kineto-0.4.0-GCC-11.3.0.eb

easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-11.3.0.eb

diff --git a/easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-11.3.0.eb b/easybuild/easyconfigs/k/kineto/kineto-20250715-GCC-13.3.0-CUDA-12.6.0.eb
index 37a633e728..ea9f20e466 100644
--- a/easybuild/easyconfigs/k/kineto/kineto-0.4.0-GCC-11.3.0.eb
+++ b/easybuild/easyconfigs/k/kineto/kineto-20250715-GCC-13.3.0-CUDA-12.6.0.eb
@@ -1,29 +1,32 @@
 easyblock = 'CMakeMake'
 
 name = 'kineto'
-version = '0.4.0'
+version = '20250715'
+local_commit = '16e2a56'
+versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/pytorch/kineto'
 description = "A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters"
 
-toolchain = {'name': 'GCC', 'version': '11.3.0'}
+toolchain = {'name': 'GCC', 'version': '13.3.0'}
 
 source_urls = ['https://github.com/pytorch/kineto/archive/']
 sources = [{
     'git_config': {
         'url': 'https://github.com/pytorch',
         'repo_name': name,
-        'tag': 'v%(version)s',
+        'commit': local_commit,
         'recursive': True,
     },
     'filename': SOURCE_TAR_XZ,
 }]
-checksums = ['5f1c744d57fdc40878b0ff87400097b04ed3027757bcccf6d4f8ceecc8e29855']
+checksums = ['bc44d710104b879f6982ffd4d79a8ba6c97c8aa5fefbcfd85c317d541ed1094d']
 
 builddependencies = [
-    ('CMake', '3.24.3'),
-    ('Python', '3.10.4', '-bare'),
+    ('CMake', '3.29.3'),
+    ('Python', '3.12.3'),
 ]
+dependencies = [('CUDA', '12.6.0', '', SYSTEM)]
 
 start_dir = 'libkineto'
 

@pavelToman
Copy link
Collaborator Author

@boegelbot please test @ jsc-zen3

@boegelbot
Copy link
Collaborator

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25146 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25146 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9488

Test results coming soon (I hope)...

Details

- notification for comment with ID 3798735263 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 1 out of 1 (total: 2 mins 50 secs) (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.23
See https://gist.github.com/boegelbot/5d372905dca6848b62d7bfe5760b4537 for a full test report.

@laraPPr
Copy link
Contributor

laraPPr commented Feb 4, 2026

Test report by @laraPPr
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 49 secs) (1 easyconfigs in total)
node4304.litleo.os - Linux RHEL 9.6 (Plow), x86_64, AMD EPYC 9454P 48-Core Processor, 1 x NVIDIA NVIDIA H100 NVL, 580.95.05, Python 3.9.21
See https://gist.github.com/laraPPr/06b0c164230457783d697df0225e8356 for a full test report.

laraPPr
laraPPr previously requested changes Feb 4, 2026
pavelToman and others added 2 commits February 9, 2026 12:35
Co-authored-by: Lara Ramona Peeters <49882639+laraPPr@users.noreply.github.com>
@pavelToman
Copy link
Collaborator Author

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Collaborator

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25146 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25146 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9610

Test results coming soon (I hope)...

Details

- notification for comment with ID 3871216402 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@pavelToman pavelToman changed the title {perf}[GCC/13.3.0] kineto v20250715 {perf}[GCC/13.3.0] kineto v20250715 w/ CUDA 12.6.0 Feb 9, 2026
@pavelToman pavelToman changed the title {perf}[GCC/13.3.0] kineto v20250715 w/ CUDA 12.6.0 {perf}[GCC/13.3.0] kineto v20250715 Feb 9, 2026
@pavelToman
Copy link
Collaborator Author

@boegelbot please test @ jsc-zen3

@pavelToman
Copy link
Collaborator Author

Test report by @pavelToman
FAILED
Build succeeded for 0 out of 1 (total: 19 secs) (1 easyconfigs in total)
node3593.doduo.os - Linux RHEL 9.6, x86_64, AMD EPYC 7552 48-Core Processor, Python 3.9.21
See https://gist.github.com/pavelToman/de90a71c60ef2fe30c6578c80988ce7c for a full test report.

@boegelbot
Copy link
Collaborator

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25146 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25146 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9612

Test results coming soon (I hope)...

Details

- notification for comment with ID 3872205927 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 0 out of 1 (total: 22 secs) (1 easyconfigs in total)
jsczen3c2.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.23
See https://gist.github.com/boegelbot/b2efe5434189db679c2a93cb46371a78 for a full test report.

@pavelToman
Copy link
Collaborator Author

Test report by @pavelToman
FAILED
Build succeeded for 0 out of 1 (total: 20 secs) (1 easyconfigs in total)
node4211.shinx.os - Linux RHEL 9.6, x86_64, AMD EPYC 9654 96-Core Processor, Python 3.9.21
See https://gist.github.com/pavelToman/7bf4a0157ea1ca3fa85b585f8bc0f16a for a full test report.

@pavelToman
Copy link
Collaborator Author

pavelToman commented Feb 9, 2026

Without CUDA the cmake is failing with:

Failed to find nvcc.
Could not find CUPTI library
Compiler requires the CUDA toolkit.  Please set the CUDAToolkit_ROOT variable.

Also in the CMakeLists.txt there is: find_package(CUDAToolkit)


name = 'kineto'
version = '20250715'
local_commit = '16e2a56'
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pavelToman @jpecar Is there a particular reason for this specific commit?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Most probably it was latest version at the time I worked on this ...

@boegel
Copy link
Member

boegel commented Feb 9, 2026

Without CUDA the cmake is failing with:

Failed to find nvcc.
Could not find CUPTI library
Compiler requires the CUDA toolkit.  Please set the CUDAToolkit_ROOT variable.

Also in the CMakeLists.txt there is: find_package(CUDAToolkit)

OK, let's include it then, but not as a build-only dependency.

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 0 out of 1 (total: 21 secs) (1 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.44.01, Python 3.9.23
See https://gist.github.com/boegelbot/da09f70fc4e33191e440684726238498 for a full test report.

@pavelToman
Copy link
Collaborator Author

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Collaborator

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25146 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25146 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9619

Test results coming soon (I hope)...

Details

- notification for comment with ID 3879016970 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
FAILED
Failed during parsing of the easyconfigs, so no ecs were built (1 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.44.01, Python 3.9.23
See https://gist.github.com/boegelbot/8d467a7ba363d7991aecf592181fcba7 for a full test report.

@pavelToman
Copy link
Collaborator Author

@boegelbot please test @ jsc-zen3-a100

@pavelToman
Copy link
Collaborator Author

Test report by @pavelToman
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 15 secs) (1 easyconfigs in total)
node4016.donphan.os - Linux RHEL 9.6, x86_64, Intel(R) Xeon(R) Gold 6240 CPU @ 2.60GHz, 1 x NVIDIA NVIDIA A2, 580.95.05, Python 3.9.21
See https://gist.github.com/pavelToman/19297f752ea6b813aae56d38deb06c0e for a full test report.

@pavelToman
Copy link
Collaborator Author

Test report by @pavelToman
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 8 secs) (1 easyconfigs in total)
node3309.joltik.os - Linux RHEL 9.6, x86_64, Intel(R) Xeon(R) Gold 6242 CPU @ 2.80GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 580.95.05, Python 3.9.21
See https://gist.github.com/pavelToman/680020c112b669833d9bea4d70bcbdce for a full test report.

@boegelbot
Copy link
Collaborator

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25146 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25146 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9620

Test results coming soon (I hope)...

Details

- notification for comment with ID 3879121972 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 20 secs) (1 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.44.01, Python 3.9.23
See https://gist.github.com/boegelbot/75f5bda0a7b183e8790ee4ad4c305170 for a full test report.

@pavelToman pavelToman changed the title {perf}[GCC/13.3.0] kineto v20250715 {perf}[GCC/13.3.0] kineto v20250715 w/ CUDA 12.6.0 Feb 10, 2026
@pavelToman
Copy link
Collaborator Author

Test report by @pavelToman
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 11 secs) (1 easyconfigs in total)
node4016.donphan.os - Linux RHEL 9.6, x86_64, Intel(R) Xeon(R) Gold 6240 CPU @ 2.60GHz, 1 x NVIDIA NVIDIA A2, 580.95.05, Python 3.9.21
See https://gist.github.com/pavelToman/7226aa74aae6e5581c4e329364fade8c for a full test report.

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel
Copy link
Member

boegel commented Feb 12, 2026

Test report by @boegel
SUCCESS
Build succeeded for 1 out of 1 (total: 56 secs) (1 easyconfigs in total)
node3902.accelgor.os - Linux RHEL 9.6, x86_64, AMD EPYC 7413 24-Core Processor (zen3), 1 x NVIDIA NVIDIA A100-SXM4-80GB, 590.48.01, Python 3.9.21
See https://gist.github.com/boegel/d00873c7317dc757496007205d5fc679 for a full test report.

@boegel boegel dismissed laraPPr’s stale review February 12, 2026 16:40

CUDA must be a runtime dep, resolved

@boegel boegel added this to the next release (5.2.1) milestone Feb 12, 2026
@boegel
Copy link
Member

boegel commented Feb 12, 2026

Going in, thanks @pavelToman!

@boegel boegel merged commit 2a6bfde into easybuilders:develop Feb 12, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2024a issues & PRs related to 2024a common toolchains update

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Dorado 1.0.2 or newer

5 participants