Skip to content

Releases: stackhpc/ansible-slurm-appliance

v1.129

18 Apr 11:23
0e6ef7e
Compare
Choose a tag to compare

What's Changed

  • Return pingpong sbatch output if job fails by @sjpb in #242
  • Support configuring nameservers and proxies by @sjpb in #247

Full Changelog: v1.128...v1.129

Deployment notes

  • No galaxy reinstalls required

Image details

v1.128

14 Apr 11:05
dcf2d1d
Compare
Choose a tag to compare

What's Changed

  • Environment hooks used when building "fat" image by @sjpb in #255 - allows building site-specific fat images.
  • Re-enable NetworkManager control of /etc/resolv.conf after image build by @sjpb in #258 - fixes #257.

Full Changelog: v1.127...v1.128

Deployment Notes

  • No galaxy reinstalls required

Image Info

NB: This build uses a RockyLinux 8.6 image plus updates instead of the Rocky-8-GenericCloud-Base-8.7-20221130.0.x86_64.qcow2 image, to avoid an issue with volume mounts during reboot in that image.

v1.127

14 Mar 13:11
416b440
Compare
Choose a tag to compare

What's Changed

  • Build fat image in appliance by @sjpb in #250:
    • The "fat" image containing binaries for all nodes is now built by this repo, not https://github.com/stackhpc/slurm_image_builder as previously. See section below for latest image details.
    • This removes environment-specific control and login node builds - use the fat image instead (these nodes already required the ansible site.yml playbook running after a reimage).
    • Compute node build is now intended to be performed from the latest fat image, hence removed the yum update *.
    • Various minor fixes, see above PR.

Full Changelog: v1.126...v1.127

Image Info

NB: This build uses a RockyLinux 8.6 image plus updates instead of the Rocky-8-GenericCloud-Base-8.7-20221130.0.x86_64.qcow2 image, to avoid an issue with volume mounts during reboot in that image.

v1.126

22 Feb 16:35
c130ed9
Compare
Choose a tag to compare

What's Changed

  • Provide container packages by @sjpb in #249: Clusters now have apptainer (for singularity), podman and podman-compose packages by default
  • Add script to retrieve CI inventory by @sjpb in #248

NB: This is now based on OpenHPC v2.6.1, removing the workarounds added in v1.123.

Full Changelog: v1.125...v1.126

Deployment Notes

Galaxy roles/collection versions have been changed so use ansible-galaxy {role,collection} install -f ... after merging to force-update these.

v1.125

26 Jan 13:52
bdeda03
Compare
Choose a tag to compare

What's Changed

Replaces OpenDistro with OpenSearch 2.4.0 by @sjpb in #197, including:

  • Grafana updated from 8.5.9 to 9.0.3.
  • Filebeat-OSS updated from 7.9.3 to 7.12.1.
  • Host group and role opendistro replaced with group and role opensearch.
  • elasticsearch_* and opendistro_* variables used for opendistro role replaced by opensearch_* variables.
  • Changed to host networking for containers.
  • Updated CaaS/CI image to openhpc-230110-1629.qcow2.

Full Changelog: v1.124...v1.125

Notes

  1. By default opendistro uses self-signed certs with 10 years validity. Certs are automatically updated if necessary when running the appliance.

Deployment

  1. Merge this release:

    • Change any environment-specific elasticsearch_* and opendistro_* variables to appropriate opensearch_* equivalents. Note the defaults in the common environment are changed by this release so only non-defaults require manual action.
    • Remove podman_cidr variable as this is unused for container host networking.
  2. Update galaxy roles and collections.

  3. Run site.yml. OpenDistro data in {{ appliances_state_dir | default('/usr/share') }}/elasticsearch/data will be automatically migrated to OpenSearch . Note the OpenDistro data will not automatically be deleted.

v1.124

12 Jan 15:34
0aaaf67
Compare
Choose a tag to compare

What's Changed

  • Remove redundant templated ansible_user definition by @sjpb in #244

Full Changelog: v1.123...v1.124

v1.123

11 Jan 09:13
6d11e4c
Compare
Choose a tag to compare

What's Changed

  • Fix RL8.6 and EPEL repo changes (by @sjpb in #245):
    • Remove releasever pinning (pinning to 8.6 no longer works as packages have been moved to Vault)
    • Bumps the "fat" image for CI appropriately (stackhpc/slurm_image_builder#13)
    • Adds an older apptainer package from github release, to workaround openhpc/ohpc#1644 (EPEL package changes break dependencies for ohpc-base-compute package).

Full Changelog: v1.122...v1.123

v1.122

11 Jan 09:12
230ff80
Compare
Choose a tag to compare

What's Changed

  • Automatically generate openondemand_host_regex (by @sjpb in #243)

Full Changelog: v1.121...v1.122