Skip to content

{2023.06}[2023a, sapphire_rapids] ESPResSO 4.2.1, Rivet 3.1.9#875

Merged
boegel merged 2 commits intoEESSI:2023.06-software.eessi.iofrom
bedroge:sapphire_rapids_eb490_2023a_espresso_rivet_pytorch
Jan 23, 2025
Merged

{2023.06}[2023a, sapphire_rapids] ESPResSO 4.2.1, Rivet 3.1.9#875
boegel merged 2 commits intoEESSI:2023.06-software.eessi.iofrom
bedroge:sapphire_rapids_eb490_2023a_espresso_rivet_pytorch

Conversation

@bedroge
Copy link
Copy Markdown
Collaborator

@bedroge bedroge commented Jan 22, 2025

No description provided.

@bedroge bedroge added 2023.06-software.eessi.io 2023.06 version of software.eessi.io sapphirerapids labels Jan 22, 2025
@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/sapphire_rapids, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-software, eessi.io-2023.06-compat

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi.io-2023.06-software, eessi.io-2023.06-compat

@gpu-bot-ugent
Copy link
Copy Markdown

gpu-bot-ugent bot commented Jan 22, 2025

Instance eessi-bot-vsc-ugent is configured to build for:

  • architectures: x86_64/amd/zen3
  • repositories: eessi.io-2023.06-software, eessi-hpc.org-2023.06-software, eessi.io-2023.06-compat, eessi-hpc.org-2023.06-compat

@bedroge
Copy link
Copy Markdown
Collaborator Author

bedroge commented Jan 22, 2025

bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids resulted in:

@gpu-bot-ugent
Copy link
Copy Markdown

gpu-bot-ugent bot commented Jan 22, 2025

Updates by the bot instance eessi-bot-vsc-ugent (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids resulted in:

    • no jobs were submitted

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids resulted in:

    • no jobs were submitted

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-sapphire_rapids for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.01/pr_875/41993

date job status comment
Jan 22 10:28:03 UTC 2025 submitted job id 41993 awaits release by job manager
Jan 22 10:28:58 UTC 2025 released job awaits launch by Slurm scheduler
Jan 22 10:35:03 UTC 2025 running job 41993 is running
Jan 22 22:30:30 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-41993.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-sapphire_rapids-1737584127.tar.gzsize: 193 MiB (202592554 bytes)
entries: 35408
modules under 2023.06/software/linux/x86_64/intel/sapphire_rapids/modules/all
BeautifulSoup/4.12.2-GCCcore-12.3.0.lua
Boost.MPI/1.82.0-gompi-2023a.lua
ESPResSo/4.2.1-foss-2023a.lua
expecttest/0.1.5-GCCcore-12.3.0.lua
fastjet/3.4.2-gompi-2023a.lua
fastjet-contrib/1.053-gompi-2023a.lua
GMP/6.2.1-GCCcore-12.3.0.lua
gmpy2/2.1.5-GCC-12.3.0.lua
GSL/2.7-GCC-12.3.0.lua
HepMC3/3.2.6-GCC-12.3.0.lua
IPython/8.14.0-GCCcore-12.3.0.lua
libsodium/1.0.18-GCCcore-12.3.0.lua
libxslt/1.1.38-GCCcore-12.3.0.lua
libyaml/0.2.5-GCCcore-12.3.0.lua
lxml/4.9.2-GCCcore-12.3.0.lua
MPC/1.3.1-GCCcore-12.3.0.lua
MPFR/4.2.0-GCCcore-12.3.0.lua
networkx/3.1-gfbf-2023a.lua
OpenPGM/5.2.122-GCCcore-12.3.0.lua
Pillow/10.0.0-GCCcore-12.3.0.lua
Pint/0.23-GCCcore-12.3.0.lua
pytest-flakefinder/1.1.0-GCCcore-12.3.0.lua
pytest-rerunfailures/12.0-GCCcore-12.3.0.lua
pytest-shard/0.1.2-GCCcore-12.3.0.lua
PyYAML/6.0-GCCcore-12.3.0.lua
Rivet/3.1.9-gompi-2023a-HepMC3-3.2.6.lua
siscone/3.0.6-GCCcore-12.3.0.lua
sympy/1.12-gfbf-2023a.lua
typing-extensions/4.9.0-GCCcore-12.3.0.lua
YODA/1.9.9-GCC-12.3.0.lua
Z3/4.12.2-GCCcore-12.3.0.lua
ZeroMQ/4.3.4-GCCcore-12.3.0.lua
software under 2023.06/software/linux/x86_64/intel/sapphire_rapids/software
BeautifulSoup/4.12.2-GCCcore-12.3.0
Boost.MPI/1.82.0-gompi-2023a
ESPResSo/4.2.1-foss-2023a
expecttest/0.1.5-GCCcore-12.3.0
fastjet/3.4.2-gompi-2023a
fastjet-contrib/1.053-gompi-2023a
GMP/6.2.1-GCCcore-12.3.0
gmpy2/2.1.5-GCC-12.3.0
GSL/2.7-GCC-12.3.0
HepMC3/3.2.6-GCC-12.3.0
IPython/8.14.0-GCCcore-12.3.0
libsodium/1.0.18-GCCcore-12.3.0
libxslt/1.1.38-GCCcore-12.3.0
libyaml/0.2.5-GCCcore-12.3.0
lxml/4.9.2-GCCcore-12.3.0
MPC/1.3.1-GCCcore-12.3.0
MPFR/4.2.0-GCCcore-12.3.0
networkx/3.1-gfbf-2023a
OpenPGM/5.2.122-GCCcore-12.3.0
Pillow/10.0.0-GCCcore-12.3.0
Pint/0.23-GCCcore-12.3.0
pytest-flakefinder/1.1.0-GCCcore-12.3.0
pytest-rerunfailures/12.0-GCCcore-12.3.0
pytest-shard/0.1.2-GCCcore-12.3.0
PyYAML/6.0-GCCcore-12.3.0
Rivet/3.1.9-gompi-2023a-HepMC3-3.2.6
siscone/3.0.6-GCCcore-12.3.0
sympy/1.12-gfbf-2023a
typing-extensions/4.9.0-GCCcore-12.3.0
YODA/1.9.9-GCC-12.3.0
Z3/4.12.2-GCCcore-12.3.0
ZeroMQ/4.3.4-GCCcore-12.3.0
other under 2023.06/software/linux/x86_64/intel/sapphire_rapids
no other files in tarball
Jan 22 22:30:30 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 10/10 test case(s) from 10 check(s) (2 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-41993.out
❌ found message matching ERROR:
❌ found message matching [\s*FAILED\s*].*Ran .* test case

edit: this failed due to too many test failures:

== 2025-01-22 22:07:56,498 pytorch.py:465 WARNING 4 test failures, 0 test errors (out of 209567):
dynamo/test_functions 1/1 (1 failed, 165 passed, 2 skipped, 2 rerun)
dynamo/test_dynamic_shapes 1/1 (2 failed, 2029 passed, 52 skipped, 31 xfailed, 4 rerun)
test_proxy_tensor 1/1 (1 failed, 2074 passed, 617 skipped, 80 xfailed, 2 rerun)

@bedroge bedroge changed the title {2023.06}[2023a, sapphire_rapids] ESPResSO 4.2.1, Rivet 3.1.9, PyTorch 2.1.2 {2023.06}[2023a, sapphire_rapids] ESPResSO 4.2.1, Rivet 3.1.9 Jan 22, 2025
@bedroge
Copy link
Copy Markdown
Collaborator Author

bedroge commented Jan 22, 2025

Haven't checked the logs yet, but it looks like PyTorch failed, so let's exclude that one for now.

bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids resulted in:

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/intel/sapphire_rapids from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/intel/sapphire_rapids resulted in:

    • no jobs were submitted

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 22, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-sapphire_rapids for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.01/pr_875/42005

date job status comment
Jan 22 22:49:41 UTC 2025 submitted job id 42005 awaits release by job manager
Jan 22 22:50:00 UTC 2025 released job awaits launch by Slurm scheduler
Jan 22 22:56:15 UTC 2025 running job 42005 is running
Jan 23 00:08:38 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-42005.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-sapphire_rapids-1737590371.tar.gzsize: 147 MiB (154755131 bytes)
entries: 30101
modules under 2023.06/software/linux/x86_64/intel/sapphire_rapids/modules/all
BeautifulSoup/4.12.2-GCCcore-12.3.0.lua
Boost.MPI/1.82.0-gompi-2023a.lua
ESPResSo/4.2.1-foss-2023a.lua
fastjet/3.4.2-gompi-2023a.lua
fastjet-contrib/1.053-gompi-2023a.lua
GSL/2.7-GCC-12.3.0.lua
HepMC3/3.2.6-GCC-12.3.0.lua
IPython/8.14.0-GCCcore-12.3.0.lua
libsodium/1.0.18-GCCcore-12.3.0.lua
libxslt/1.1.38-GCCcore-12.3.0.lua
lxml/4.9.2-GCCcore-12.3.0.lua
OpenPGM/5.2.122-GCCcore-12.3.0.lua
Pint/0.23-GCCcore-12.3.0.lua
Rivet/3.1.9-gompi-2023a-HepMC3-3.2.6.lua
siscone/3.0.6-GCCcore-12.3.0.lua
typing-extensions/4.9.0-GCCcore-12.3.0.lua
YODA/1.9.9-GCC-12.3.0.lua
ZeroMQ/4.3.4-GCCcore-12.3.0.lua
software under 2023.06/software/linux/x86_64/intel/sapphire_rapids/software
BeautifulSoup/4.12.2-GCCcore-12.3.0
Boost.MPI/1.82.0-gompi-2023a
ESPResSo/4.2.1-foss-2023a
fastjet/3.4.2-gompi-2023a
fastjet-contrib/1.053-gompi-2023a
GSL/2.7-GCC-12.3.0
HepMC3/3.2.6-GCC-12.3.0
IPython/8.14.0-GCCcore-12.3.0
libsodium/1.0.18-GCCcore-12.3.0
libxslt/1.1.38-GCCcore-12.3.0
lxml/4.9.2-GCCcore-12.3.0
OpenPGM/5.2.122-GCCcore-12.3.0
Pint/0.23-GCCcore-12.3.0
Rivet/3.1.9-gompi-2023a-HepMC3-3.2.6
siscone/3.0.6-GCCcore-12.3.0
typing-extensions/4.9.0-GCCcore-12.3.0
YODA/1.9.9-GCC-12.3.0
ZeroMQ/4.3.4-GCCcore-12.3.0
other under 2023.06/software/linux/x86_64/intel/sapphire_rapids
no other files in tarball
Jan 23 00:08:38 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 10/10 test case(s) from 10 check(s) (2 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-42005.out
❌ found message matching ERROR:
❌ found message matching [\s*FAILED\s*].*Ran .* test case
Jan 23 07:23:27 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-sapphire_rapids-1737590371.tar.gz to S3 bucket succeeded

@boegel boegel added the bot:deploy Ask bot to deploy missing software installations to EESSI label Jan 23, 2025
@boegel
Copy link
Copy Markdown
Contributor

boegel commented Jan 23, 2025

Tests for ESPResSo fail with:

FAILURE INFO for EESSI_ESPRESSO_LJ_PARTICLES %module_name=ESPResSo/4.2.1-foss-2023a %device_type=cpu %scale=1_node (run: 1/1)
  * Description:
  * System partition: BotBuildTests:x86-64-intel-srapids-node
  * Environment: default
  * Stage directory: /project/60006/SHARED/jobs/2025.01/pr_875/event_27554380-d913-11ef-9e7f-82ef17532ce4/run_000/linux_x86_64_intel_sapphire_rapids/eessi.io-2023.06-software/reframe_runs/stage/BotBuildTests/x86-64-intel-srapids-node/default/EESSI_ESPRESSO_LJ_PARTICLES_fdd6aced
  * Node list:
  * Job type: local (id=None)
  * Dependencies (conceptual): []
  * Dependencies (actual): []
  * Maintainers: []
  * Failing phase: setup
  * Rerun with '-n /fdd6aced -p default --system BotBuildTests:x86-64-intel-srapids-node -r'
  * Reason: attribute error: EESSI-test-suite/eessi/testsuite/hooks.py:720: 'EESSI_ESPRESSO_LJ_PARTICLES' object has no attribute 'always_request_gpus'
    always_request_gpus = FEATURES[ALWAYS_REQUEST_GPUS] in test.current_partition.features or test.always_request_gpus

That looks like a bug in the test itself, not a problem with the installation, so I won't let that block the deployment of this...

@boegel boegel merged commit a100070 into EESSI:2023.06-software.eessi.io Jan 23, 2025
@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 23, 2025

PR merged! Moved ['/project/def-users/SHARED/jobs/2025.01/pr_875/41993', '/project/def-users/SHARED/jobs/2025.01/pr_875/42005'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.01.23

@eessi-bot
Copy link
Copy Markdown

eessi-bot bot commented Jan 23, 2025

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.01.23

@bedroge bedroge deleted the sapphire_rapids_eb490_2023a_espresso_rivet_pytorch branch January 23, 2025 08:29
@boegel boegel added the EuroHPC label Jun 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023.06-software.eessi.io 2023.06 version of software.eessi.io bot:deploy Ask bot to deploy missing software installations to EESSI EuroHPC sapphirerapids

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants