Releases: stackhpc/ansible-slurm-appliance
v1.129
What's Changed
- Return pingpong sbatch output if job fails by @sjpb in #242
- Support configuring nameservers and proxies by @sjpb in #247
Full Changelog: v1.128...v1.129
Deployment notes
- No galaxy reinstalls required
Image details
v1.128
What's Changed
- Environment hooks used when building "fat" image by @sjpb in #255 - allows building site-specific fat images.
- Re-enable NetworkManager control of
/etc/resolv.conf
after image build by @sjpb in #258 - fixes #257.
Full Changelog: v1.127...v1.128
Deployment Notes
- No galaxy reinstalls required
Image Info
- Provides new image openhpc-230412-1447-e3769af6.qcow2
- Requires a 40GB root disk.
NB: This build uses a RockyLinux 8.6 image plus updates instead of the Rocky-8-GenericCloud-Base-8.7-20221130.0.x86_64.qcow2 image, to avoid an issue with volume mounts during reboot in that image.
v1.127
What's Changed
- Build fat image in appliance by @sjpb in #250:
- The "fat" image containing binaries for all nodes is now built by this repo, not https://github.com/stackhpc/slurm_image_builder as previously. See section below for latest image details.
- This removes environment-specific control and login node builds - use the fat image instead (these nodes already required the ansible
site.yml
playbook running after a reimage). - Compute node build is now intended to be performed from the latest fat image, hence removed the yum update *.
- Various minor fixes, see above PR.
Full Changelog: v1.126...v1.127
Image Info
- Provides image openhpc-230221-1226-f5ba2db7.qcow2.
- Requires a 40GB root disk.
NB: This build uses a RockyLinux 8.6 image plus updates instead of the Rocky-8-GenericCloud-Base-8.7-20221130.0.x86_64.qcow2 image, to avoid an issue with volume mounts during reboot in that image.
v1.126
What's Changed
- Provide container packages by @sjpb in #249: Clusters now have
apptainer
(for singularity),podman
andpodman-compose
packages by default - Add script to retrieve CI inventory by @sjpb in #248
NB: This is now based on OpenHPC v2.6.1, removing the workarounds added in v1.123.
Full Changelog: v1.125...v1.126
Deployment Notes
Galaxy roles/collection versions have been changed so use ansible-galaxy {role,collection} install -f ...
after merging to force-update these.
v1.125
What's Changed
Replaces OpenDistro with OpenSearch 2.4.0 by @sjpb in #197, including:
- Grafana updated from 8.5.9 to 9.0.3.
- Filebeat-OSS updated from 7.9.3 to 7.12.1.
- Host group and role
opendistro
replaced with group and roleopensearch
. elasticsearch_*
andopendistro_*
variables used for opendistro role replaced byopensearch_*
variables.- Changed to host networking for containers.
- Updated CaaS/CI image to
openhpc-230110-1629.qcow2
.
Full Changelog: v1.124...v1.125
Notes
- By default opendistro uses self-signed certs with 10 years validity. Certs are automatically updated if necessary when running the appliance.
Deployment
-
Merge this release:
- Change any environment-specific
elasticsearch_*
andopendistro_*
variables to appropriateopensearch_*
equivalents. Note the defaults in the common environment are changed by this release so only non-defaults require manual action. - Remove
podman_cidr
variable as this is unused for container host networking.
- Change any environment-specific
-
Update galaxy roles and collections.
-
Run
site.yml
. OpenDistro data in{{ appliances_state_dir | default('/usr/share') }}/elasticsearch/data
will be automatically migrated to OpenSearch . Note the OpenDistro data will not automatically be deleted.
v1.124
v1.123
What's Changed
- Fix RL8.6 and EPEL repo changes (by @sjpb in #245):
- Remove releasever pinning (pinning to 8.6 no longer works as packages have been moved to Vault)
- Bumps the "fat" image for CI appropriately (stackhpc/slurm_image_builder#13)
- Adds an older apptainer package from github release, to workaround openhpc/ohpc#1644 (EPEL package changes break dependencies for ohpc-base-compute package).
Full Changelog: v1.122...v1.123