update TensorFlow easyblock for CUDA support in TensorFlow 2.18+#3765
Conversation
|
As for building TensorFlow 2.18+ with our CUDA: They don't support that (anymore) officially and strongly suggest to use the "hermetic" one, i.e. let Bazel download it during build. They argue that the build already "downloads half the internet" so one more doesn't hurt and they use checksums too for verification. Would that be acceptable for us or shall we still pursue using our CUDA? See easybuilders/easybuild-easyconfigs#22921 (comment) Edit: Solution implemented in #3791 : Symlink CUPTI files in CUDA module so they will be found |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 0 out of 1 (1 easyconfigs in total) |
|
@lexming Can you search the log for the error? With easybuilders/easybuild-framework#4942 the test report would likely contain it so maybe we can that in soon |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 0 out of 1 (1 easyconfigs in total) |
|
@Flamefire my tests on 2022a and 2023a failed due to linking issues with OpenSSL in the system: This error is not caused by this PR though. The problem is that these old toolchains use OpenSSL v1.1, while my system (Rocky 9) has OpenSSL v3 and just some compat libs with OpenSSL v1.1. This means that the headers for OpenSSL under So, rebuilding OpenSSL v1.1 from source and testing again... |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
|
Merging, thanks @Flamefire ! |
(created using
eb --new-pr)A few variables were renamed or removed. Most importantly are the
LOCAL_*variables to find the installed CUDA.I also upgraded the warning of unknown variables "passed" to configure to a more visible one as that was easy to miss
Requires rebuild of CUDA with
Test report: easybuilders/easybuild-easyconfigs#22921 (comment)