Skip to content

Update MSU glopara paths to new role-global space#3443

Merged
DavidHuber-NOAA merged 9 commits into
NOAA-EMC:developfrom
KateFriedman-NOAA:feature/role-global-msu
Mar 18, 2025
Merged

Update MSU glopara paths to new role-global space#3443
DavidHuber-NOAA merged 9 commits into
NOAA-EMC:developfrom
KateFriedman-NOAA:feature/role-global-msu

Conversation

@KateFriedman-NOAA
Copy link
Copy Markdown
Contributor

Description

This PR updates the glopara paths for MSU to use the new role-global space for staged support files.

Resolves #3203

Type of change

  • Bug fix (fixes something broken)
  • New feature (adds functionality)
  • Maintenance (code refactor, clean-up, new CI test, etc.)

Change characteristics

Workflow hosts file paths for Orion and Hercules

How has this been tested?

Ran Hercules CI manually

…l-msu

* origin/develop:
  Check if HOMEDIR STMP and PTMP are writable (NOAA-EMC#3430)
  Update UFS_Utils and GFS-utils hashes to update Gaea support and ocean/ice post products (NOAA-EMC#3433)
  Enable C1152 forecasts on gaea C6 (NOAA-EMC#3438)
  Migration to role account for Jenkins on Hercules (NOAA-EMC#3423)
  Remove Direct Linking to COM from DATA for `extractvars` Job (NOAA-EMC#3379)
@KateFriedman-NOAA KateFriedman-NOAA removed the request for review from WalterKolczynski-NOAA March 13, 2025 18:26
@TerrenceMcGuinness-NOAA
Copy link
Copy Markdown
Collaborator

TerrenceMcGuinness-NOAA commented Mar 13, 2025

Looks good. Let's test it with CI on Orion and Hercules simultaneously on the role accounts after PR 3440 goes in.

@TerrenceMcGuinness-NOAA TerrenceMcGuinness-NOAA added CI-Hercules-Ready **CM use only** PR is ready for CI testing on Hercules CI-Gaeac6-Ready **CM use only** PR is ready for CI testing on Gaea C6 CI-Orion-Ready **CM use only** PR is ready for CI testing on Orion labels Mar 14, 2025
@emcbot emcbot added CI-Gaeac6-Building **Bot use only** CI testing is cloning/building on Gaea C6 CI-Hercules-Building **Bot use only** CI testing is cloning/building on Hercules CI-Orion-Building **Bot use only** CI testing is cloning/building on Orion CI-Hercules-Running **Bot use only** CI testing on Hercules for this PR is in-progress CI-Orion-Running **Bot use only** CI testing on Orion for this PR is in-progress and removed CI-Gaeac6-Ready **CM use only** PR is ready for CI testing on Gaea C6 CI-Hercules-Ready **CM use only** PR is ready for CI testing on Hercules CI-Orion-Ready **CM use only** PR is ready for CI testing on Orion CI-Gaeac6-Building **Bot use only** CI testing is cloning/building on Gaea C6 CI-Hercules-Building **Bot use only** CI testing is cloning/building on Hercules CI-Orion-Building **Bot use only** CI testing is cloning/building on Orion labels Mar 14, 2025
@TerrenceMcGuinness-NOAA TerrenceMcGuinness-NOAA added CI-Orion-Failed **Bot use only** CI testing on Orion for this PR has failed CI-Hercules-Failed **Bot use only** CI testing on Hercules for this PR has failed and removed CI-Orion-Running **Bot use only** CI testing on Orion for this PR is in-progress CI-Hercules-Running **Bot use only** CI testing on Hercules for this PR is in-progress labels Mar 14, 2025
@emcbot emcbot added the CI-Orion-Building **Bot use only** CI testing is cloning/building on Orion label Mar 14, 2025
@KateFriedman-NOAA
Copy link
Copy Markdown
Contributor Author

@KateFriedman-NOAA @aerorahul Oh wait, no this is different. role.glopara needs to be added to the list of users with HPSS access on C6. Can one of you make this request?

Is this a helpdesk request? If so, which helpdesk? Or somewhere else?

@emcbot emcbot added CI-Hercules-Failed **Bot use only** CI testing on Hercules for this PR has failed CI-Hercules-Passed **Bot use only** CI testing on Hercules for this PR has completed successfully and removed CI-Hercules-Building **Bot use only** CI testing is cloning/building on Hercules CI-Hercules-Failed **Bot use only** CI testing on Hercules for this PR has failed labels Mar 14, 2025
@DavidHuber-NOAA
Copy link
Copy Markdown
Contributor

I think it is an HPSS request (rdhpcs.hpss.help@noaa.gov).

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_S2SWA_gefs FAILED on Gaeac6 in Build# 1 with error logs:

/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C48_S2SWA_gefs_5e585d9f/logs/2021032312/gefs_arch_tars.log

Follow link here to view the contents of the above file(s): (link)

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_S2SWA_gefs FAILED on Gaeac6 in Build# 1 in
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/EXPDIR/C48_S2SWA_gefs_5e585d9f

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

CI Failed on Gaeac6 in Build# 1
Built and ran in directory /gpfs/f6/drsa-precip3/proj-shared/global/CI/3443


Experiment C48_ATM_5e585d9f Terminated with 0
FAIL
FAIL tasks failed and 1 dead at Fri 14 Mar 2025 01:37:43 PM EDT
Experiment C48_ATM_5e585d9f Terminated: *FAIL*
Error logs:
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C48_ATM_5e585d9f/logs/2021032312/gfs_arch_tars.log
Experiment C96C48_hybatmDA_5e585d9f Terminated with 0
FAIL
FAIL tasks failed and 3 dead at Fri 14 Mar 2025 01:42:59 PM EDT
Experiment C96C48_hybatmDA_5e585d9f Terminated: *FAIL*
Error logs:
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_5e585d9f/logs/2021122100/enkfgdas_earc_tars_00.log
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_5e585d9f/logs/2021122100/enkfgdas_earc_tars_01.log
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_5e585d9f/logs/2021122100/gdas_arch_tars.log
Experiment C96_atm3DVar_5e585d9f Terminated with 0
FAIL
FAIL tasks failed and 1 dead at Fri 14 Mar 2025 01:49:37 PM EDT
Experiment C96_atm3DVar_5e585d9f Terminated: *FAIL*
Error logs:
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C96_atm3DVar_5e585d9f/logs/2021122100/gdas_arch_tars.log
Experiment C48_S2SW_5e585d9f Terminated with 0
FAIL
FAIL tasks failed and 1 dead at Fri 14 Mar 2025 01:55:33 PM EDT
Experiment C48_S2SW_5e585d9f Terminated: *FAIL*
Error logs:
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C48_S2SW_5e585d9f/logs/2021032312/gfs_arch_tars.log
Experiment C48_S2SWA_gefs_5e585d9f Terminated with 0
FAIL
FAIL tasks failed and 1 dead at Fri 14 Mar 2025 02:34:56 PM EDT
Experiment C48_S2SWA_gefs_5e585d9f Terminated: *FAIL*
Error logs:
/gpfs/f6/drsa-precip3/proj-shared/global/CI/3443/RUNTESTS/COMROOT/C48_S2SWA_gefs_5e585d9f/logs/2021032312/gefs_arch_tars.log

@KateFriedman-NOAA
Copy link
Copy Markdown
Contributor Author

KateFriedman-NOAA commented Mar 14, 2025

I think it is an HPSS request (rdhpcs.hpss.help@noaa.gov).

Ticket submitted! RDHPCS#2025031454000197

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_ATM FAILED on Hercules in Build# 10 with error logs:

/work2/noaa/global/role-global/GFS_CI_CD/HERCULES/CI/3443/RUNTESTS/COMROOT/C48_ATM_5e585d9f/logs/2021032312/gfs_atmos_prod_f048-f051.log

Follow link here to view the contents of the above file(s): (link)

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_ATM FAILED on Hercules in Build# 10 in
/work2/noaa/global/role-global/GFS_CI_CD/HERCULES/CI/3443/RUNTESTS/EXPDIR/C48_ATM_5e585d9f

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_ATM FAILED on Orion in Build# 12 with error logs:

/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C48_ATM_30e91283/logs/2021032312/gfs_atmos_prod_f048-f051.log

Follow link here to view the contents of the above file(s): (link)

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C48_ATM FAILED on Orion in Build# 12 in
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/EXPDIR/C48_ATM_30e91283

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C96mx100_S2S FAILED on Orion in Build# 12 with error logs:

/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C96mx100_S2S_30e91283/logs/1994050100/sfs_atmos_prod_mem000_f024.log

Follow link here to view the contents of the above file(s): (link)

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 14, 2025

Experiment C96mx100_S2S FAILED on Orion in Build# 12 in
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/EXPDIR/C96mx100_S2S_30e91283

@TerrenceMcGuinness-NOAA
Copy link
Copy Markdown
Collaborator

TerrenceMcGuinness-NOAA commented Mar 14, 2025

@KateFriedman-NOAA Following the links to error logs there are some grib files missing in the atmos model on both Orion and Hercules.

*** FATAL ERROR: missing input file /work2/noaa/global/role-global/GFS_CI_CD/HERCULES/CI/3443/RUNTESTS/COMROOT/C48_ATM_5e585d9f/gfs.20210323/12//model/atmos/master/gfs.t12z.master.grb2f048 ***

Some tests are passing. I'll list out the passed tests to contrast with these fails next week.

@DavidHuber-NOAA
Copy link
Copy Markdown
Contributor

DavidHuber-NOAA commented Mar 17, 2025

@TerrenceMcGuinness-NOAA When this error occurs during CI, it usually indicates that one of the tests started over but DATAROOT was not deleted first. This tricks the forecast into thinking that it needs to restart, but the post jobs will fail due to missing data. There are two solutions: 1) delete DATAROOT before starting CI again or 2) update the branch. The latter will work because the CI tests will be from a new hash, meaning the DATAROOT directory will be different. Issue #3071 was opened to address this.

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 17, 2025

Experiment C96C48_hybatmDA FAILED on Orion in Build# 14 with error logs:

/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_30e91283/logs/2021122100/gfs_atmos_prod_f117.log
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_30e91283/logs/2021122100/gfs_atmos_prod_f120.log

Follow link here to view the contents of the above file(s): (link)

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 17, 2025

Experiment C96C48_hybatmDA FAILED on Orion in Build# 14 in
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/EXPDIR/C96C48_hybatmDA_30e91283

@TerrenceMcGuinness-NOAA
Copy link
Copy Markdown
Collaborator

TerrenceMcGuinness-NOAA commented Mar 17, 2025

/work2 has become unavailable on both Orion and Hercules. We are working to find the problem and get it resolved as quickly as possible. We will send an update as we learn more.

For any associated problems, please submit a help desk ticket.
HPC2 users email: help@hpc.msstate.edu

Joey B. Jones

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 17, 2025

CI Failed on Orion in Build# 14
Built and ran in directory /work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443


Experiment C48_ATM_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 01:43:23 PM CDT 2025
Experiment C96mx100_S2S_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 02:20:04 PM CDT 2025
Experiment C96C48_hybatmDA_30e91283 Terminated with 0
FAIL
FAIL tasks failed and 2 dead at Mon Mar 17 02:38:38 PM CDT 2025
Experiment C96C48_hybatmDA_30e91283 Terminated: *FAIL*
Error logs:
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_30e91283/logs/2021122100/gfs_atmos_prod_f117.log
/work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443/RUNTESTS/COMROOT/C96C48_hybatmDA_30e91283/logs/2021122100/gfs_atmos_prod_f120.log
Experiment C96_atm3DVar_30e91283 Completed 3 Cycles: *SUCCESS* at Mon Mar 17 04:11:37 PM CDT 2025
Experiment C48_S2SWA_gefs_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 04:36:11 PM CDT 2025
Experiment C48_S2SW_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 06:07:19 PM CDT 2025

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 17, 2025

CI Passed on Hercules in Build# 13
Built and ran in directory /work2/noaa/global/role-global/GFS_CI_CD/HERCULES/CI/3443


Experiment C48_ATM_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 13:21:51 CDT 2025
Experiment C48mx500_hybAOWCDA_30e91283 Completed 2 Cycles: *SUCCESS* at Mon Mar 17 13:52:15 CDT 2025
Experiment C96mx100_S2S_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 14:04:13 CDT 2025
Experiment C96_atm3DVar_30e91283 Completed 3 Cycles: *SUCCESS* at Mon Mar 17 15:47:02 CDT 2025
Experiment C96C48_hybatmDA_30e91283 Completed 3 Cycles: *SUCCESS* at Mon Mar 17 15:47:02 CDT 2025
Experiment C48mx500_3DVarAOWCDA_30e91283 Completed 2 Cycles: *SUCCESS* at Mon Mar 17 17:23:48 CDT 2025
Experiment C48_S2SWA_gefs_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 18:12:25 CDT 2025
Experiment C48_S2SW_30e91283 Completed 1 Cycles: *SUCCESS* at Mon Mar 17 18:54:38 CDT 2025

@emcbot
Copy link
Copy Markdown

emcbot commented Mar 18, 2025

CI Passed on Orion in Build# 15
Built and ran in directory /work2/noaa/global/role-global/GFS_CI_CD/ORION/CI/3443


Experiment C48_ATM_30e91283 Completed 1 Cycles: *SUCCESS* at Tue Mar 18 05:57:45 AM CDT 2025
Experiment C96mx100_S2S_30e91283 Completed 1 Cycles: *SUCCESS* at Tue Mar 18 06:34:20 AM CDT 2025
Experiment C96C48_hybatmDA_30e91283 Completed 3 Cycles: *SUCCESS* at Tue Mar 18 07:35:51 AM CDT 2025
Experiment C96_atm3DVar_30e91283 Completed 3 Cycles: *SUCCESS* at Tue Mar 18 07:41:56 AM CDT 2025
Experiment C48_S2SW_30e91283 Completed 1 Cycles: *SUCCESS* at Tue Mar 18 07:47:55 AM CDT 2025
Experiment C48_S2SWA_gefs_30e91283 Completed 1 Cycles: *SUCCESS* at Tue Mar 18 08:07:04 AM CDT 2025

Copy link
Copy Markdown
Contributor

@DavidHuber-NOAA DavidHuber-NOAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI-Hercules-Passed **Bot use only** CI testing on Hercules for this PR has completed successfully CI-Orion-Passed **Bot use only** CI testing on Orion for this PR has completed successfully

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Establish global role account on MSU

4 participants