Skip to content

{geo}[foss/2023a] TorchGeo v0.6.0 w/ CUDA 12.1.1#21308

Closed
branfosj wants to merge 7 commits intoeasybuilders:developfrom
branfosj:20240904084015_new_pr_TorchGeo060
Closed

{geo}[foss/2023a] TorchGeo v0.6.0 w/ CUDA 12.1.1#21308
branfosj wants to merge 7 commits intoeasybuilders:developfrom
branfosj:20240904084015_new_pr_TorchGeo060

Conversation

@branfosj
Copy link
Copy Markdown
Member

@branfosj branfosj commented Sep 4, 2024

(created using eb --new-pr)

We cannot use the easyconfig for rasterio as that has a boto3 dependency that provides different versions of various boto packages than are in the PyTorch-Lightning easyconfig.

@branfosj branfosj added the new label Sep 4, 2024
@branfosj branfosj added this to the 4.x milestone Sep 4, 2024
@smoors
Copy link
Copy Markdown
Contributor

smoors commented Sep 13, 2024

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Copy Markdown
Collaborator

@smoors: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=21308 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_21308 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 4874

Test results coming soon (I hope)...

Details

- notification for comment with ID 2350071909 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 1 out of 3 (3 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.4, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 555.42.06, Python 3.9.18
See https://gist.github.com/boegelbot/e664f115656d759b5063676a14f2cf0a for a full test report.

@smoors
Copy link
Copy Markdown
Contributor

smoors commented Sep 13, 2024

Test report by @smoors
FAILED
Build succeeded for 4 out of 6 (3 easyconfigs in total)
node400.hydra.os - Linux Rocky Linux 8.10, x86_64, AMD EPYC 7282 16-Core Processor (zen2), 1 x NVIDIA NVIDIA A100-PCIE-40GB, 550.90.07, Python 3.6.8
See https://gist.github.com/smoors/aae3a4f97844f470df86fdca234f0f18 for a full test report.

Add CMake builddep for kornia
@github-actions github-actions bot added update and removed new labels Mar 4, 2025
@github-actions
Copy link
Copy Markdown

github-actions bot commented Mar 4, 2025

Updated software TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb

Diff against TorchGeo-0.6.0-foss-2023a.eb

easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a.eb

diff --git a/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a.eb b/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb
index f26fd966eb..d17b45d9c1 100644
--- a/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a.eb
+++ b/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb
@@ -3,6 +3,7 @@ easyblock = 'PythonBundle'
 
 name = 'TorchGeo'
 version = '0.6.0'
+versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = "https://torchgeo.readthedocs.io/"
 description = """TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data."""
@@ -17,22 +18,22 @@ builddependencies = [
 ]
 
 dependencies = [
+    ('CUDA', '12.1.1', '', SYSTEM),
     ('Python', '3.11.3'),
     ('SciPy-bundle', '2023.07'),
-    ('GDAL', '3.7.1'),
     ('matplotlib', '3.7.2'),
-    ('PyTorch', '2.1.2'),
-    ('PyTorch-Lightning', '2.2.1'),
-    ('Lightning', '2.2.1'),
+    ('PyTorch', '2.1.2', versionsuffix),
+    ('PyTorch-Lightning', '2.2.1', versionsuffix),
+    ('Lightning', '2.2.1', versionsuffix),
     ('einops', '0.7.0'),
     ('Fiona', '1.9.5'),
     ('pyproj', '3.6.0'),
     ('Rtree', '1.2.0'),
     ('Shapely', '2.0.1'),
-    ('torchvision', '0.16.0'),
+    ('torchvision', '0.16.0', versionsuffix),
     # OPTIONAL
     ('h5py', '3.9.0'),
-    ('OpenCV', '4.8.1', '-contrib'),
+    ('OpenCV', '4.8.1', '%s-contrib' % versionsuffix),
     ('scikit-image', '0.22.0'),
 ]
 

Updated software TorchGeo-0.6.0-foss-2023a.eb

Diff against TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb

easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb

diff --git a/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb b/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a.eb
index d17b45d9c1..f26fd966eb 100644
--- a/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a-CUDA-12.1.1.eb
+++ b/easybuild/easyconfigs/t/TorchGeo/TorchGeo-0.6.0-foss-2023a.eb
@@ -3,7 +3,6 @@ easyblock = 'PythonBundle'
 
 name = 'TorchGeo'
 version = '0.6.0'
-versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = "https://torchgeo.readthedocs.io/"
 description = """TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data."""
@@ -18,22 +17,22 @@ builddependencies = [
 ]
 
 dependencies = [
-    ('CUDA', '12.1.1', '', SYSTEM),
     ('Python', '3.11.3'),
     ('SciPy-bundle', '2023.07'),
+    ('GDAL', '3.7.1'),
     ('matplotlib', '3.7.2'),
-    ('PyTorch', '2.1.2', versionsuffix),
-    ('PyTorch-Lightning', '2.2.1', versionsuffix),
-    ('Lightning', '2.2.1', versionsuffix),
+    ('PyTorch', '2.1.2'),
+    ('PyTorch-Lightning', '2.2.1'),
+    ('Lightning', '2.2.1'),
     ('einops', '0.7.0'),
     ('Fiona', '1.9.5'),
     ('pyproj', '3.6.0'),
     ('Rtree', '1.2.0'),
     ('Shapely', '2.0.1'),
-    ('torchvision', '0.16.0', versionsuffix),
+    ('torchvision', '0.16.0'),
     # OPTIONAL
     ('h5py', '3.9.0'),
-    ('OpenCV', '4.8.1', '%s-contrib' % versionsuffix),
+    ('OpenCV', '4.8.1', '-contrib'),
     ('scikit-image', '0.22.0'),
 ]
 

@branfosj
Copy link
Copy Markdown
Member Author

I'll not revisit this to fix TorchGeo to fix the pipeline, so closing the PR.

@branfosj branfosj closed this Jun 17, 2025
@branfosj branfosj deleted the 20240904084015_new_pr_TorchGeo060 branch June 17, 2025 09:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants