Multiple domains quilting restart#1722
Conversation
|
Please link FV3 and atmos_cubed_sphere PR's. Please run GNU/Intel RT's and note Tests that change. |
|
@DusanJovic-NOAA please bring these up to date with respective authoritative repositories
|
|
@DusanJovic-NOAA we can start working on this pr. Can you sync up branches? |
Merged. |
|
@DusanJovic-NOAA do you have GNU/Intel RT logs from runs on either Hera or Cheyenne? |
RegressionTests_hera.intel.log This is a hera log from my test I ran yesterday. I do not have gnu log, but since gnu test does not run HAFS I do not expect changes in the baselines. |
|
@BrianCurtis-NOAA I'm going to start testing through this PR next. |
|
Jenkins-ci logs attached. ORTs passed. I will now begin manually creating the new baselines for the below tests. |
|
@jkbk2004 All four of the new Hafs qr cases are failing on cheyenne.intel. |
|
@jkbk2004 I can try to run these cases with hdf5 1.14.0 |
|
It sounds like new cases are running with -DDEBUG=ON. At least not crashing but very slowly: /glade/scratch/jongkim/pr-1722-intel/jongkim/FV3_RT/rt_60395/hafs_regional_1nest_atm_qr. looks like crash is system/compiler/mpt issue. Not practical to run the new cases on cheyenne. @DusanJovic-NOAA @zach1221 can we agree to turn the new cases off on cheyenne? |
I agree, turn off 4 new hafs tests on cheyenne. |
|
@DusanJovic-NOAA Do you have any insight into the exact line it is complaining about? https://github.com/DusanJovic-NOAA/fv3atm/blob/0379fd48f24dd67cab6f8b88b2b77fabfa7afc71/io/module_write_restart_netcdf.F90#L452 |
|
I can create a new issue regarding the hafs qr error on cheyenne.intel. |
I don't. That's the function that actually writes the array into a file. The error message is 'NetCDF: HDF error' , so I assume it's something wrong in the netcdf or hdf5 library. @zach1221 Did you try to run one of these 4 tests using hdf5 1.14.0? |
|
Hi, @DusanJovic-NOAA. Yes, I tried hdf5 1.14.0 with hafs_regional_1nest_atm_qr, but received the same error message posted above. Perhaps it would be worth it to try with netcdf updated to version 4.9.4 as well. |
|
Ok, I'll continue investigating the failure in the UFS-WM issues queue. In the meantime, @DusanJovic-NOAA, if you want to turn off the 4 new hafs tests on cheyenne then I think we're ready to proceed final review/approvals. |
I disabled those 4 tests on cheyenne |
93346f7
|
Apologies I sent the reviews out before the fv3 submodule pointer was updated and gitmodules were reverted. Can you please provide your approval again? @DeniseWorthen @BrianCurtis-NOAA |
Description
This PR updates fv3atm write grid component to allow writing the restart files for multiple domains (nests). Three new tests were added which compare the RESTART files written by the write grid component with restart files written by the FMS
Fixes: #1628
Top of commit queue on: TBD
Input data additions/changes
Anticipated changes to regression tests:
hafs_global_1nest_atm
hafs_global_multiple_4nests_atm
hafs_regional_1nest_atm
hafs_regional_storm_following_1nest_atm
Subcomponents involved:
Combined with PR's (If Applicable):
Commit Queue Checklist:
Linked PR's and Issues:
FV3ATM: #650
GFDL Cubed Sphere: #268
Please link the related issues to be closed with this PR, whether in this repository, or in another repository.
EXAMPLE: Closes NOAA-EMC/fv3atm/issues/<issue_number>
-->
Testing Day Checklist:
Testing Log (for CM's):