Skip to content

Update AWS defaults for running obs prep jobs on cloud#3681

Merged
aerorahul merged 10 commits into
NOAA-EMC:developfrom
TaylorRoper-NOAA:feature/prep_jobs_aws
May 14, 2025
Merged

Update AWS defaults for running obs prep jobs on cloud#3681
aerorahul merged 10 commits into
NOAA-EMC:developfrom
TaylorRoper-NOAA:feature/prep_jobs_aws

Conversation

@TaylorRoper-NOAA
Copy link
Copy Markdown
Contributor

@TaylorRoper-NOAA TaylorRoper-NOAA commented May 12, 2025

Description

This PR enables running prep jobs on AWS. This required supporting obsproc and prepobs on the cloud (NOAA-EMC/obsproc#99 and NOAA-EMC/prepobs#43) and deciding on placement of BASE_GIT directory which houses obsproc and prepobs. BASE_GIT is proposed to be /contrib/git on AWS (and other NOAA CSPs), and obsproc and prepobs have been cloned and installed in /contrib/git on the EPIC's AWS /contrib filesystem for future CI testing. The obs data used was made available on AWS via #3494, and the DMPDIR is defaulted to /bucket/dump or user-defined via DATA_MOUNT for AWS. Since restricted data is not yet supported on NOAA CSPs, AWS defaults are updated to include CHGRP_RSTPROD='NO' and MAKE_NSSTBUFR='YES'. Updating CHGRP_RSTPROD is necessary since NSST buffer files are assumed in obsproc to consist of restricted data and are overwritten as empty when chgrp rstprod fails. Allowing making the NSST buffer file on the fly with CHGRP_RSTPROD='NO' was done via NOAA-EMC/obsproc#100.

Resolves #3497

Type of change

  • Bug fix (fixes something broken)
  • New feature (adds functionality)
  • Maintenance (code refactor, clean-up, new CI test, etc.)

Change characteristics

  • Is this a breaking change (a change in existing functionality)? NO
  • Does this change require a documentation update? NO
  • Does this change require an update to any of the following submodules? NO

How has this been tested?

Tested on AWS and Hera by running the C96_atm3DVar CI's starting half cycle and gfs_prep and gdas_prep for first full cycle. On Hera, configuration was updated to MAKE_NSSTBUFR='YES' for comparison. Resulting logs, prepbufr files, and nsstbufr files can be found on Hera at /scratch1/BMC/qosap/Taylor.Roper/shared/prep_tests. The end of the file names indicate whether they are from the cloud or Hera.

Checklist

  • Any dependent changes have been merged and published
  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have documented my code, including function, input, and output descriptions
  • My changes generate no new warnings
  • New and existing tests pass with my changes
  • This change is covered by an existing CI test or a new one has been added
  • Any new scripts have been added to the .github/CODEOWNERS file with owners
  • I have made corresponding changes to the system documentation if necessary

@TaylorRoper-NOAA TaylorRoper-NOAA changed the title Enable running obs prep jobs on AWS Update AWS defaults for running obs prep jobs on cloud May 13, 2025
Copy link
Copy Markdown
Contributor

@KateFriedman-NOAA KateFriedman-NOAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks @TaylorRoper-NOAA !

Copy link
Copy Markdown
Contributor

@aerorahul aerorahul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@aerorahul aerorahul merged commit bc60ec9 into NOAA-EMC:develop May 14, 2025
5 checks passed
@TaylorRoper-NOAA TaylorRoper-NOAA deleted the feature/prep_jobs_aws branch May 14, 2025 16:32
aerorahul pushed a commit that referenced this pull request May 14, 2025
This PR adds Fit2Obs to `modulefiles/module_base.noaacloud.lua` .
Fit2Obs was updated to support NOAA cloud via
NOAA-EMC/Fit2Obs#35. Fit2Obs is installed in
`/contrib` filesystem at `/contrib/git` on AWS for QOSAP and EPIC. This
is consistent with the choice for `BASE_GIT` made in
#3681.

Resolves #3682
tsga added a commit to tsga/global-workflow that referenced this pull request May 15, 2025
* develop:
  Move parm/config/sfs/config.globus to dev/parm/config/sfs (NOAA-EMC#3697)
  Add Fit2Obs to modulefiles/module_base.noaacloud.lua (NOAA-EMC#3695)
  Update AWS defaults for running obs prep jobs on cloud (NOAA-EMC#3681)
  Add GCAFS forecast-only mode to the workflow
  Adds marine DA ensstat files to archiving (NOAA-EMC#3631)
  Migrate to mercury for globus/hpss transfers from MSU (NOAA-EMC#3655)
  STY: Remove empty __init.py__ files. (NOAA-EMC#3691)
  Relocate config templates to `dev/` space in prep for EE2 (NOAA-EMC#3684)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Enable running gfs and gdas prep jobs for global data assimilation experiments on AWS

3 participants