Skip to content

Add master grib files to GFS HPSS archive for AIGFS#4203

Merged
aerorahul merged 4 commits into
NOAA-EMC:developfrom
CatherineThomas-NOAA:gribarch
Nov 3, 2025
Merged

Add master grib files to GFS HPSS archive for AIGFS#4203
aerorahul merged 4 commits into
NOAA-EMC:developfrom
CatherineThomas-NOAA:gribarch

Conversation

@CatherineThomas-NOAA
Copy link
Copy Markdown
Contributor

@CatherineThomas-NOAA CatherineThomas-NOAA commented Oct 31, 2025

Description

AIGFS currently trains with 0.25 degree grib2 data. For future development, higher resolution data will be used.

The files that are needed from GDAS are the F006 and analysis master grib files (edit to add: F003 as well). This PR updates the HPSS archive list to archive those additional grib files and their associated idx files for the deterministic gdas forecast only.

Resolves #4191

Type of change

  • New feature (adds functionality)

Change characteristics

  • Is this a breaking change (a change in existing functionality)? NO
  • Does this change require a documentation update? NO
  • Does this change require an update to any of the following submodules? NO

How has this been tested?

CI on Gaea-C6 passed. HPSS tarballs were checked for relevant files.

Checklist

  • Any dependent changes have been merged and published
  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have documented my code, including function, input, and output descriptions
  • My changes generate no new warnings
  • New and existing tests pass with my changes
  • This change is covered by an existing CI test or a new one has been added
  • Any new scripts have been added to the .github/CODEOWNERS file with owners
  • I have made corresponding changes to the system documentation if necessary

junwang-noaa
junwang-noaa previously approved these changes Oct 31, 2025
@junwang-noaa
Copy link
Copy Markdown
Contributor

@CatherineThomas-NOAA I got some feedback on AIGFSv1 that users ask for more frequent output (GFS is hourly, and AIGFS is 6 hourly). May I ask if we can save hourly master file from f000-f006, or at aleast f003 so that we may have 3 hourly data for training? Thanks

@CatherineThomas-NOAA
Copy link
Copy Markdown
Contributor Author

I ran the CI on Gaea C6. The lowres v17 test has the following new entries in the gdas.tar file:

htar -tvf /NCEPDEV/emc-da/1year/Catherine.Thomas/GAEAC6/scratch/C96C48mx500_S2SW_cyc_gfs_gribarch/2021122100/gdas.tar
...
HTAR: -rw-r--r--  Catherine.Thomas/gfs-cpu   18051150 2025-10-31 16:40  gdas.20211221/00/model/atmos/master/gdas.t00z.master.analysis.grib2
HTAR: -rw-r--r--  Catherine.Thomas/gfs-cpu     238674 2025-10-31 16:40  gdas.20211221/00/model/atmos/master/gdas.t00z.master.analysis.grib2.idx
HTAR: -rw-r--r--  Catherine.Thomas/gfs-cpu   20185776 2025-10-31 16:49  gdas.20211221/00/model/atmos/master/gdas.t00z.master.f003.grib2
HTAR: -rw-r--r--  Catherine.Thomas/gfs-cpu   20244475 2025-10-31 16:49  gdas.20211221/00/model/atmos/master/gdas.t00z.master.f006.grib2

@junwang-noaa - Is this sufficient?

@CatherineThomas-NOAA
Copy link
Copy Markdown
Contributor Author

Thanks @junwang-noaa. Taking out of draft.

@CatherineThomas-NOAA CatherineThomas-NOAA marked this pull request as ready for review November 3, 2025 16:37
@CatherineThomas-NOAA
Copy link
Copy Markdown
Contributor Author

CI Tests on Gaea-C6 pass:

SKIP C48_ATM_ecflow on gaeac6
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C48_ATM_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202103231200        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:50:09
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C48mx500_3DVarAOWCDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202103241800        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:10:11
202103250000        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:05:08
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C48mx500_hybAOWCDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202103241800        Done    Oct 31 2025 14:50:06    Oct 31 2025 15:05:10
202103250000        Done    Oct 31 2025 14:50:06    Oct 31 2025 20:05:09
 
SKIP C48_S2SWA_gefs_RT on gaeac6
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C48_S2SWA_gefs_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202103231200        Done    Oct 31 2025 14:50:05    Oct 31 2025 16:10:21
 
SKIP C48_S2SW_extended on gaeac6
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C48_S2SW_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202103231200        Done    Oct 31 2025 14:50:06    Oct 31 2025 15:55:11
 
SKIP C96_atm3DVar_extended on gaeac6
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96_atm3DVar_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201800        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:10:10
202112210000        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:20
202112210600        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:20
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96C48_hybatmDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201800        Done    Oct 31 2025 14:50:06    Oct 31 2025 15:10:10
202112210000        Done    Oct 31 2025 14:50:06    Oct 31 2025 20:15:17
202112210600        Done    Oct 31 2025 14:50:06    Oct 31 2025 20:55:05
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96C48_hybatmsnowDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201200        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:10:11
202112201800        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:17
202112210000        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:17
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96C48_hybatmsoilDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202205150600        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:15:07
202205151200        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:17
202205151800        Done    Oct 31 2025 14:50:05    Oct 31 2025 20:15:17
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96C48mx500_S2SW_cyc_gfs_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201200        Done    Oct 31 2025 14:50:06    Oct 31 2025 15:10:11
202112201800        Done    Oct 31 2025 14:50:06    Oct 31 2025 20:05:08
202112210000        Done    Oct 31 2025 14:50:06    Oct 31 2025 20:05:08
 
SKIP C96C48_ufs_hybatmDA on gaeac6
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96_gcafs_cycled_noDA_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201200        Done    Oct 31 2025 14:50:06    Oct 31 2025 15:15:08
202112201800        Done    Oct 31 2025 14:50:06    Oct 31 2025 16:40:07
202112210000        Done    Oct 31 2025 14:50:06    Oct 31 2025 16:15:07
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96_gcafs_cycled_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
202112201200        Done    Oct 31 2025 14:50:05    Oct 31 2025 15:15:07
202112201800        Done    Oct 31 2025 14:50:05    Oct 31 2025 16:55:08
202112210000        Done    Oct 31 2025 14:50:05    Oct 31 2025 16:30:09
 
/gpfs/f6/gfs-cpu/world-shared/Catherine.Thomas/tmp/RUNTESTS_gribarch/EXPDIR/C96mx100_S2S_gribarch
   CYCLE         STATE           ACTIVATED              DEACTIVATED     
199405010000        Done    Oct 31 2025 14:50:06    Oct 31 2025 16:15:08

Copy link
Copy Markdown
Contributor

@aerorahul aerorahul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good. Just one question.

Comment thread parm/archive/gdas.yaml.j2
Comment on lines +177 to +178
- "{{ COMIN_ATMOS_MASTER | relpath(ROTDIR) }}/{{ head }}master.f003.grib2"
- "{{ COMIN_ATMOS_MASTER | relpath(ROTDIR) }}/{{ head }}master.f006.grib2"
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we include the idx files?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, I wonder if we are even generating the index files for these. Probably are not.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had thought about this as well. We're at least not saving them to COM, we don't have them in our parallels. I didn't check if there is a silent failure somewhere though.

@WenMeng-NOAA - Can you confirm whether the master grib files should have idx files saved in COM as well?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The idx files for the master are not created, similar to the sflux files. The sflux idx files are generated in the atmos_products task. I'll start generating the master.idx files if the group wants that, and start including it in COM. The master files are strictly internal to GFS (until now for archiving for AIGFS).

Comment thread parm/archive/gdas.yaml.j2
{% endfor %}


# Forecast Master GRIB2 data for F006
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# Forecast Master GRIB2 data for F006
# Forecast Master GRIB2 data for F003 and F006

@aerorahul aerorahul merged commit db43b5a into NOAA-EMC:develop Nov 3, 2025
5 checks passed
@CatherineThomas-NOAA CatherineThomas-NOAA deleted the gribarch branch November 3, 2025 17:07
weihuang-jedi added a commit to NOAA-EPIC/global-workflow-cloud that referenced this pull request Nov 5, 2025
…NOAA-EPIC/global-workflow-cloud into feature/use_container_spack-stack-1.9.2

* 'feature/use_container_spack-stack-1.9.2' of github.com:NOAA-EPIC/global-workflow-cloud:
  reverse few changes
  re-sync with EMC repo
  Add master grib files to GFS HPSS archive for AIGFS (NOAA-EMC#4203)
  Update Snow filenames to comply with EE2 (NOAA-EMC#4195)
  Rename files for JEDI atm EE2 (NOAA-EMC#4193)
  Generate `pres_b` files for `RUN=gdas` and update `APCP` to `598`. (NOAA-EMC#4196)
  Update checks for MOM6 restarts when performing a re-run on failure (NOAA-EMC#4179)
  Decrease HPSS storage for GFS retros and address hpss bugs (NOAA-EMC#4184)
  Add noaacloud to ufsda case in dev/ush/load_modules (NOAA-EMC#4198)
  Remove replay from global workflow  (NOAA-EMC#4182)
  Add IODA stats text file to COM (NOAA-EMC#4176)
  Update UFS_UTILS submodule (NOAA-EMC#4178)
  Atm COM File Rename to Standardized Form  (NOAA-EMC#4117)
  Replace cp with cpfs/cpreq for atomic copies to COM directories (NOAA-EMC#4130)
  Create a UPP module for the global workflow (NOAA-EMC#4174)
  Refactor marine DA tasks (NOAA-EMC#4160)
  Delay ocean post-processing trigger to next-next forecast (NOAA-EMC#4167)
  Make options hashes
  Remove multiple option from static data template
  Fix static_data yaml (descriptions and labels)
  Fix static_data yaml (remove colon)
  Add Ursa to and remove C5 from list of HPCs in the bug report template (NOAA-EMC#4164)
  Rename marine (ocean/ice) files following EE2 conventions (NOAA-EMC#4162)
  Add attributes to Gaussian grid sfcanl file (NOAA-EMC#4149)
  Remove the snow analysis from archive (NOAA-EMC#4157)
  Update verif-global to fix pcp failures on special cases (NOAA-EMC#4154)
  Add CRTM fix directory paths to global-workflow (NOAA-EMC#4143)
  Update UFS Model  (NOAA-EMC#4138)
  Add functionality to assimilate the new snow observations (NOAA-EMC#4132)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Archive additional files in the GFS for AIGFS

3 participants