Skip to content

upgrade NCCL dependency from 2.26.6 to 2.27.7 in easyconfigs for CuPy 13.6.0, OSU-Micro-Benchmarks 7.5, UCC-CUDA 1.3.0 using a 2025a toolchain#25249

Merged
akesandgren merged 1 commit intoeasybuilders:developfrom
lexming:20260210145211_new_pr_NCCL2277
Feb 16, 2026
Merged

upgrade NCCL dependency from 2.26.6 to 2.27.7 in easyconfigs for CuPy 13.6.0, OSU-Micro-Benchmarks 7.5, UCC-CUDA 1.3.0 using a 2025a toolchain#25249
akesandgren merged 1 commit intoeasybuilders:developfrom
lexming:20260210145211_new_pr_NCCL2277

Conversation

@lexming
Copy link
Copy Markdown
Contributor

@lexming lexming commented Feb 10, 2026

(created using eb --new-pr)

We need a newer NCCL to be able to install PyTorch 2.9.1 in 2025a. Given that existing easyconfigs depending on NCCL are few, this change should not cause much trouble.

@lexming lexming added the update label Feb 10, 2026
@github-actions github-actions bot added 2025a issues & PRs related to 2025a common toolchains change labels Feb 10, 2026
@github-actions
Copy link
Copy Markdown

Updated software NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb

Diff against NCCL-2.26.2-GCCcore-13.3.0-CUDA-12.6.0.eb

easybuild/easyconfigs/n/NCCL/NCCL-2.26.2-GCCcore-13.3.0-CUDA-12.6.0.eb

diff --git a/easybuild/easyconfigs/n/NCCL/NCCL-2.26.2-GCCcore-13.3.0-CUDA-12.6.0.eb b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
index 5a167a50e7..2f3df755fb 100644
--- a/easybuild/easyconfigs/n/NCCL/NCCL-2.26.2-GCCcore-13.3.0-CUDA-12.6.0.eb
+++ b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
@@ -1,23 +1,23 @@
 name = 'NCCL'
-version = '2.26.2'
+version = '2.27.7'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://developer.nvidia.com/nccl'
 description = """The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective
 communication primitives that are performance optimized for NVIDIA GPUs."""
 
-toolchain = {'name': 'GCCcore', 'version': '13.3.0'}
+toolchain = {'name': 'GCCcore', 'version': '14.2.0'}
 
 github_account = 'NVIDIA'
 source_urls = [GITHUB_SOURCE]
 sources = ['v%(version)s-1.tar.gz']
-checksums = ['74c6ab40c864d79c2139508e9419de5970cb406ec85f001d5f834d5f5c0c4f3b']
+checksums = ['98e6262bd55932c51e7c8ffc50cc764f019e4b94a8fd6694d839ae828ec8d128']
 
 builddependencies = [('binutils', '2.42')]
 
 dependencies = [
-    ('CUDA', '12.6.0', '', SYSTEM),
-    ('UCX-CUDA', '1.16.0', versionsuffix),
+    ('CUDA', '12.8.0', '', SYSTEM),
+    ('UCX-CUDA', '1.18.0', versionsuffix),
 ]
 
 # default CUDA compute capabilities to use (override via --cuda-compute-capabilities)
Diff against NCCL-2.27.7-GCCcore-14.3.0-CUDA-12.9.1.eb

easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.3.0-CUDA-12.9.1.eb

diff --git a/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.3.0-CUDA-12.9.1.eb b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
index d573781fef..2f3df755fb 100644
--- a/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.3.0-CUDA-12.9.1.eb
+++ b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
@@ -6,18 +6,18 @@ homepage = 'https://developer.nvidia.com/nccl'
 description = """The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective
 communication primitives that are performance optimized for NVIDIA GPUs."""
 
-toolchain = {'name': 'GCCcore', 'version': '14.3.0'}
+toolchain = {'name': 'GCCcore', 'version': '14.2.0'}
 
 github_account = 'NVIDIA'
 source_urls = [GITHUB_SOURCE]
 sources = ['v%(version)s-1.tar.gz']
 checksums = ['98e6262bd55932c51e7c8ffc50cc764f019e4b94a8fd6694d839ae828ec8d128']
 
-builddependencies = [('binutils', '2.44')]
+builddependencies = [('binutils', '2.42')]
 
 dependencies = [
-    ('CUDA', '12.9.1', '', SYSTEM),
-    ('UCX-CUDA', '1.19.0', versionsuffix),
+    ('CUDA', '12.8.0', '', SYSTEM),
+    ('UCX-CUDA', '1.18.0', versionsuffix),
 ]
 
 # default CUDA compute capabilities to use (override via --cuda-compute-capabilities)
Diff against NCCL-2.26.6-GCCcore-14.2.0-CUDA-12.8.0.eb

easybuild/easyconfigs/n/NCCL/NCCL-2.26.6-GCCcore-14.2.0-CUDA-12.8.0.eb

diff --git a/easybuild/easyconfigs/n/NCCL/NCCL-2.26.6-GCCcore-14.2.0-CUDA-12.8.0.eb b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
index 31a072cbdd..2f3df755fb 100644
--- a/easybuild/easyconfigs/n/NCCL/NCCL-2.26.6-GCCcore-14.2.0-CUDA-12.8.0.eb
+++ b/easybuild/easyconfigs/n/NCCL/NCCL-2.27.7-GCCcore-14.2.0-CUDA-12.8.0.eb
@@ -1,5 +1,5 @@
 name = 'NCCL'
-version = '2.26.6'
+version = '2.27.7'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://developer.nvidia.com/nccl'
@@ -11,7 +11,7 @@ toolchain = {'name': 'GCCcore', 'version': '14.2.0'}
 github_account = 'NVIDIA'
 source_urls = [GITHUB_SOURCE]
 sources = ['v%(version)s-1.tar.gz']
-checksums = ['2a4f86198510e1f0764c116b33ff70e082240f87d158b2017d7f34c7c3768ac6']
+checksums = ['98e6262bd55932c51e7c8ffc50cc764f019e4b94a8fd6694d839ae828ec8d128']
 
 builddependencies = [('binutils', '2.42')]
 

@lexming
Copy link
Copy Markdown
Contributor Author

lexming commented Feb 16, 2026

@boegelbot please test @ jsc-zen3-a100

@easybuilders easybuilders deleted a comment from boegelbot Feb 16, 2026
@easybuilders easybuilders deleted a comment from boegelbot Feb 16, 2026
@boegelbot
Copy link
Copy Markdown
Collaborator

@lexming: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25249 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25249 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9683

Test results coming soon (I hope)...

Details

- notification for comment with ID 3907805874 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 4 out of 4 (total: 18 mins 16 secs) (4 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.44.01, Python 3.9.23
See https://gist.github.com/boegelbot/859457d4423ccb54bf7196fc0aa7e24a for a full test report.

Copy link
Copy Markdown
Contributor

@akesandgren akesandgren left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@akesandgren
Copy link
Copy Markdown
Contributor

Going in, thanks @lexming!

@akesandgren akesandgren merged commit 70357f9 into easybuilders:develop Feb 16, 2026
8 checks passed
@lexming lexming deleted the 20260210145211_new_pr_NCCL2277 branch February 16, 2026 12:53
@boegel boegel changed the title upgrade NCCL dependency in 2025a toolchains to version 2.27 upgrade NCCL dependency from 2.26.6 to 2.27.7 in easyconfigs for CuPy 13.6.0, OSU-Micro-Benchmarks 7.5, UCC-CUDA 1.3.0 using a 2025a toolchain Feb 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025a issues & PRs related to 2025a common toolchains change update

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants