Skip to content

Update TensorFlow easyblock for v2.18.x#3699

Merged
boegel merged 13 commits intoeasybuilders:developfrom
pavelToman:patch-2
May 21, 2025
Merged

Update TensorFlow easyblock for v2.18.x#3699
boegel merged 13 commits intoeasybuilders:developfrom
pavelToman:patch-2

Conversation

@pavelToman
Copy link
Contributor

@pavelToman pavelToman commented Apr 15, 2025

@boegel boegel changed the title Update tensorflow.py for v2.17.1 Update TensorFlow easyblock for for v2.17.1 Apr 23, 2025
@boegel boegel added the update label Apr 23, 2025
@boegel boegel added this to the release after 5.0.1 milestone Apr 23, 2025
@pavelToman pavelToman changed the title Update TensorFlow easyblock for for v2.17.1 Update TensorFlow easyblock for v2.17.1 Apr 24, 2025
@pavelToman pavelToman changed the title Update TensorFlow easyblock for v2.17.1 Update TensorFlow easyblock for v2.18.1 May 6, 2025
@pavelToman
Copy link
Contributor Author

@boegelbot please test @ jsc-zen3
EB_ARGS="TensorFlow-2.15.1-foss-2023a.eb TensorFlow-2.13.0-foss-2023a.eb"

@boegelbot
Copy link

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3699 EB_ARGS="TensorFlow-2.15.1-foss-2023a.eb TensorFlow-2.13.0-foss-2023a.eb" EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3699 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 6303

Test results coming soon (I hope)...

Details

- notification for comment with ID 2857925788 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@pavelToman
Copy link
Contributor Author

@boegelbot please test @ jsc-zen3-a100
EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb"

@boegelbot
Copy link

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3699 EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb" EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3699 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 6304

Test results coming soon (I hope)...

Details

- notification for comment with ID 2857928452 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

Copy link
Contributor

@Flamefire Flamefire left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A bit polishing and the last comment looks possibly important.

The PR looks good to me and those changes aren't very important.

Co-authored-by: Alexander Grund <Flamefire@users.noreply.github.com>
@boegelbot
Copy link

Test report by @boegelbot

Overview of tested easyconfigs (in order)

  • SUCCESS TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/17f5cf32dc8f54992eb88c086344b975 for a full test report.

@boegelbot
Copy link

Test report by @boegelbot

Overview of tested easyconfigs (in order)

  • SUCCESS TensorFlow-2.15.1-foss-2023a.eb
  • SUCCESS TensorFlow-2.13.0-foss-2023a.eb

Build succeeded for 2 out of 2 (2 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/5ab311b5e61d6ed6d0127d937113bad6 for a full test report.

@Flamefire
Copy link
Contributor

Flamefire commented May 16, 2025

2.18+ seems to require --config=cuda_wheel independent whether CUDA being used. Otherwise it fails with

Please provide --config=cuda_wheel for bazel build command. If you absolutely need to add CUDA dependencies, provide --@local_config_cuda//cuda:override_include_cuda_libs=true.

This flag and error is new in 2.18

I.e. add self.target_opts.append('--config=cuda_wheel') somewhere reasonably conditioned on 2.18+

@pavelToman
Copy link
Contributor Author

@boegelbot please test @ jsc-zen3-a100
EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb"

@boegelbot
Copy link

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3699 EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb" EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3699 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 6363

Test results coming soon (I hope)...

Details

- notification for comment with ID 2886617350 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link

Test report by @boegelbot

Overview of tested easyconfigs (in order)

  • SUCCESS TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/fc6efcda67f8e6e887de18c83fb78c5e for a full test report.

@pavelToman
Copy link
Contributor Author

@boegelbot please test @ jsc-zen3-a100
EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb"

@boegelbot
Copy link

@pavelToman: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3699 EB_ARGS="TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb" EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3699 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 6385

Test results coming soon (I hope)...

Details

- notification for comment with ID 2890681720 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link

Test report by @boegelbot

Overview of tested easyconfigs (in order)

  • SUCCESS TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3c3.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/e92c490a062f51e60856902056420be1 for a full test report.

@boegel boegel self-assigned this May 20, 2025
Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel boegel changed the title Update TensorFlow easyblock for v2.18.1 Update TensorFlow easyblock for v2.18.x May 20, 2025
@boegel
Copy link
Member

boegel commented May 20, 2025

Test report by @boegel

Overview of tested easyconfigs (in order)

  • SUCCESS Bazel-6.1.0-GCCcore-12.3.0.eb
  • SUCCESS TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb

Build succeeded for 2 out of 2 (1 easyconfigs in total)
node3905.accelgor.os - Linux RHEL 9.4, x86_64, AMD EPYC 7413 24-Core Processor (zen3), 1 x NVIDIA NVIDIA A100-SXM4-80GB, 570.133.20, Python 3.9.18
See https://gist.github.com/boegel/b535d1f67711010bbcec8be180a74e13 for a full test report.

@boegel
Copy link
Member

boegel commented May 21, 2025

Test report by @boegel

Overview of tested easyconfigs (in order)

  • SUCCESS TensorFlow-2.15.1-foss-2023a-CUDA-12.1.1.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
node4013.donphan.os - Linux RHEL 9.4, x86_64, Intel(R) Xeon(R) Gold 6240 CPU @ 2.60GHz (cascadelake), 1 x NVIDIA NVIDIA A2, 570.86.15, Python 3.9.18
See https://gist.github.com/boegel/6fd3e6aee60833d1b98ec59b63a4dd5a for a full test report.

@boegel boegel merged commit 556409f into easybuilders:develop May 21, 2025
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CheckM2

4 participants