Upgrade to spack-stack 1.9.2#3798
Conversation
|
@JessicaMeixner-NOAA @bbakernoaa @aerorahul The The log file can be found here: I am inclined to disable this test on Hercules and open an issue to investigate it. Thoughts? |
|
I agree - however, SFS is using hercules for a lot of their runs, so we should also have @NeilBarton-NOAA and @XiaqiongZhou-NOAA weigh in. |
|
This test should not be deleted. I'll look at the logs |
|
@NeilBarton-NOAA Just clarifying that I'm not proposing that we delete the test. I would just make the following change: --- a/dev/ci/cases/pr/C96mx100_S2S.yaml
+++ b/dev/ci/cases/pr/C96mx100_S2S.yaml
@@ -20,3 +20,4 @@ arguments:
skip_ci_on_hosts:
- gaeac5
- awsepicglobalworkflow
+ - herculesThis would disable the test on Hercules while the issue can be investigated. |
|
SFS routinely runs on hercules. |
|
I wonder if this could be a related issue to the change to the |
|
I have successful runs of S2S from 8/14: /work2/noaa/marine/jmeixner/hercules/gw3798_20250814/t3798/COMROOT/C48_S2SW_t3798 and the C96: /work2/noaa/marine/jmeixner/hercules/gw3798_20250814/t3798/COMROOT/C96mx100_S2S_t3798/logs |
|
@JessicaMeixner-NOAA Could you run these to open up permissions on the log directories? find /work2/noaa/marine/jmeixner/hercules/gw3798_20250814/t3798/COMROOT/* -type d -exec chmod o+rx {} +
find /work2/noaa/marine/jmeixner/hercules/gw3798_20250814/t3798/COMROOT/* -type f -exec chmod o+r {} + |
|
Tagging @EdwardSafford-NOAA for awareness:
|
|
@DavidNew-NOAA @RussTreadon-NOAA I'm running this branch on Ursa, and for C96C48_ufs_hybatmDA atmanlvar and atmensanlobs both run out of memory. See logs here: /scratch3/NCEPDEV/climate/Jessica.Meixner/gwpr3798/ut03/COMROOT/C96C48_ufs_hybatmDA_ut03/logs/2024022400/ |
|
@CoryMartin-NOAA - An out of memory error on Ursa: /scratch3/NCEPDEV/climate/Jessica.Meixner/gwpr3798/ut03/COMROOT/C96_gcafs_cycled_ut03/logs/2021122018/gcdas_aeroanlvar.log.0 @bbakernoaa @lipan-NOAA - I don't quite understand this error, b/c I don't think these jobs actually ran out of walltime, but the gcafs forecasts are failling w/"out of walltime" on Ursa using this branch: |
|
Are we only using one node on Ursa? I see 96 cores. Can we change it to max 48 tasks per node for Ursa for the aeroanl job? |
/scratch3/NCEPDEV/climate/Jessica.Meixner/gwpr3798/ut03/COMROOT/C96_gcafs_cycled_ut03/logs/2021122012/gcdas_fcst_seg0.log looks like it completed. Am I missing something? |
|
I've been informed that the spack-stack issues have been resolved on Hercules and Orion. I'm going to recompile the SFS model and relaunch the forecasts to see if that resolves the issue. |
|
Confirmed that GSI builds on Orion and Hercules now load the correct fortran compiler. Test done with GSI to |
|
I know the issue for the SFS forecasts now. I made a local change to |
|
SFS forecasts have made it to the 12 hour mark without any signs of instability. |
|
great. thanks @DavidHuber-NOAA |
|
All tests passed on Hercules 🎉: |
|
Requesting final approvals for this PR. |
JessicaMeixner-NOAA
left a comment
There was a problem hiding this comment.
Yay!!!!!
@bbakernoaa - thanks for helping get all the gocart model update parts together.
Huge thanks to @DavidHuber-NOAA for all the work on getting all of the spack-stack pieces together!!
|
🎉 |
|
Thank you all for your effort in getting this and several other issues resolved with this PR. |

Description
This updates the global workflow to spack-stack 1.9.2 as well as the subcomponents. This PR also includes a partial port to Ursa, though no global workflow tests have been nor can be run there yet.
Fixes #2984
Fixes #3920
Fixes #3921
Fixes #3922
Fixes #3923
Fixes #3852
Fixes #2756
Fixes #3934
Type of change
Change characteristics
How has this been tested?
Checklist