Build ufs_weather_model on Gaea-C5 + update cubed sphere gitmodules for perturbation/increments for cold starts #2271#2269
Conversation
|
Hello..This PR is in support of NOAA-EMC/global-workflow#2535. Please let me know how I should proceed with testing. Since this PR will not affect other platforms, do I need to run RT on Hera/Derecho/Hercules? Thanks for your time! |
|
@DavidBurrows-NCO Develop branch builds and runs ok on gaea-c5. Can you explain why we need the code change of this PR? Any issue porting global workflow to gaea-c5? |
@jkbk2004 Have you looked into @DavidBurrows-NCO changes to see if it works for the UFSWM on Gaea-C5? |
|
|
Sorry for accidentally closing/re-opening. Anyway, @BrianCurtis-NOAA @DavidBurrows-NCO the gaea modulefile of this pr is a lot clean. Let me throw a run on gaea. @RatkoVasic-NOAA can you take a look at this modulefile update for gaea? |
One concerning things to me was the removal of the stack-python as it might cause issues with the abort_dep_tasks.py script. |
|
Good Morning...the previous module file didn't contain a load cmake, so you get the following error upon building: I added the module load cmake then did some clean up as well. The modulefile for gaea now resembles gfs_utils, ufs_utils, and upp. |
|
This changes worked for me. @BrianCurtis-NOAA can you give me example so I can try to get failure with abort_dep_tasks.py |
Mostly with importing python ecflow package in https://github.com/ufs-community/ufs-weather-model/blob/develop/tests/abort_dep_tasks.py#L3. But it sounds like it runs ok. @RatkoVasic-NOAA a decision point on your side is to specifically control thru stack-python. I think its safe to keep moving with the stack-python. @DavidBurrows-NCO can you put the stack-python line back. So we can move on. |
3a9d23d to
db99a8b
Compare
|
@BrianCurtis-NOAA @RatkoVasic-NOAA I added stack-python load back into the module file and tested a build. |
This concern is just based on rt.sh failing with running ECFlow on Gaea. @RatkoVasic-NOAA are you using ecflow to run the rt.sh suite? |
|
@BrianCurtis-NOAA I didn't run full test suite, just about 10 test cases to see if it brakes compilation or some executions. I used rocoto. I can now try with ecflow same thing. |
|
Selected 10 tests worked using both ecflow and rocoto for me. |
OK Great. @jkbk2004 please run the full rt suite on Gaea-C5 using ecflow and we can then merge this PR with another non-baseline changing PR once that's completed. |
Sure! |
|
@DavidBurrows-NCO This rap clm_lake case is crashing with this pr. experiment is at /gpfs/f5/epic/scratch/Jong.Kim/RT_RUNDIRS |
|
@jkbk2004 I am also getting a failure with that test. Here are some highlights from the output: Are you getting the same error? I don't see any other hints in the working directory for a reason for this failure. |
|
I cannot see your directories due to restrictive permissions. What you've shown me so far are errors that have nothing to do with CLM lake. I cannot debug farther until I see the stack trace. |
|
@SamuelTrahanNOAA Can I move the working dir somewhere on Gaea for you? My Gaea groups are epic and ufs-ard. Thanks, Sam! |
I don't know where your working directory resides. You don't necessarily need to move it; I just need the permissions to allow world access. This is not a stack trace:
The stack trace would include a trace of the files and line numbers to the point in the stack with the divide by zero. |
|
@SamuelTrahanNOAA The working dir is here: |
HYCOM-interface has a change but no .gitmodule entry. Please fix. |
HYCOM hash is already committed one on emc/develop. Good to go with hash update. |
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
on-behalf-of @ufs-community <ecc.platform@noaa.gov>
|
@jkbk2004 @FernandoAndrade-NOAA I talked to Brian offline. We're good to skip acorn. |
|
Thanks for everyone's help with this PR @jkbk2004 @zach1221 @FernandoAndrade-NOAA @BrianCurtis-NOAA @SamuelTrahanNOAA @RatkoVasic-NOAA @DusanJovic-NOAA @junwang-noaa @weihuang-jedi |
Commit Queue Requirements:
Description:
Commit Message:
Priority:
Git Tracking
UFSWM:
Sub component Pull Requests:
UFSWM Blocking Dependencies:
Changes
Regression Test Changes (Please commit test_changes.list):
Input data Changes:
Library Changes/Upgrades:
Testing Log: