Skip to content

Fix atmaero resources and turn test on for cheyenne.intel (replaces PR #1273)#1291

Merged
jkbk2004 merged 203 commits into
ufs-community:developfrom
DeniseWorthen:feature/atmaero_resources2
Jun 27, 2022
Merged

Fix atmaero resources and turn test on for cheyenne.intel (replaces PR #1273)#1291
jkbk2004 merged 203 commits into
ufs-community:developfrom
DeniseWorthen:feature/atmaero_resources2

Conversation

@DeniseWorthen
Copy link
Copy Markdown
Collaborator

@DeniseWorthen DeniseWorthen commented Jun 24, 2022

PR Checklist

  • This PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR. Please consult the ufs-weather-model wiki if you are unsure how to do this.

  • This PR has been tested using a branch which is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR

  • An Issue describing the work contained in this PR has been created either in the subcomponent(s) or in the ufs-weather-model. The Issue should be created in the repository that is most relevant to the changes in contained in the PR. The Issue and the dependent sub-component PR
    are specified below.

  • Results for one or more of the regression tests change and the reasons for the changes are understood and explained below.

  • New or updated input data is required by this PR. If checked, please work with the code managers to update input data sets on all platforms.

Instructions: All subsequent sections of text should be filled in as appropriate.

The information provided below allows the code managers to understand the changes relevant to this PR, whether those changes are in the ufs-weather-model repository or in a subcomponent repository. Ufs-weather-model code managers will use the information provided to add any applicable labels, assign reviewers and place it in the Commit Queue. Once the PR is in the Commit Queue, it is the PR owner's responsibility to keep the PR up-to-date with the develop branch of ufs-weather-model.

Description

Creates a set of default resources for the atmaero_control_p8 and turn test back on for cheyenne.intel

Issue(s) addressed

Testing

The new default resources were tested on hera.intel. The test reproduced the existing baseline.

NOTE: A new baseline for the atmaero_control_p8 test will need to be created on cheyenne.intel. All other platforms where the test exists will reproduce the current baseline.

  • hera.intel
  • hera.gnu
  • orion.intel
  • cheyenne.intel
  • cheyenne.gnu
  • gaea.intel
  • jet.intel
  • wcoss2.intel
  • wcoss_cray
  • wcoss_dell_p3
  • opnReqTest for newly added/changed feature
  • CI

Dependencies

None; script level changes only

DeniseWorthen and others added 30 commits March 27, 2021 12:30
This reverts commit 7b826d4.
on-behalf-of @ufs-community <brian.curtis@noaa.gov>
@BrianCurtis-NOAA
Copy link
Copy Markdown
Collaborator

Automated RT Failure Notification
Machine: jet
Compiler: intel
Job: RT
[RT] Repo location: /lfs4/HFIP/h-nems/emc.nemspara/autort/pr/978368110/20220625024509/ufs-weather-model
Please make changes and add the following label back: jet-intel-RT

on-behalf-of @ufs-community <brian.curtis@noaa.gov>
on-behalf-of @ufs-community <brian.curtis@noaa.gov>
on-behalf-of @ufs-community <brian.curtis@noaa.gov>
@BrianCurtis-NOAA
Copy link
Copy Markdown
Collaborator

Automated RT Failure Notification
Machine: jet
Compiler: intel
Job: RT
[RT] Repo location: /lfs4/HFIP/h-nems/emc.nemspara/autort/pr/978368110/20220625164513/ufs-weather-model
[RT] Error: Test hafs_regional_atm 102 failed in run_test failed
[RT] Error: Test hafs_regional_atm_thompson_gfdlsf 103 failed in run_test failed
[RT] Error: Test hafs_regional_atm_ocn 104 failed in run_test failed
[RT] Error: Test hafs_regional_atm_wav 105 failed in run_test failed
[RT] Error: Test hafs_regional_atm_ocn_wav 106 failed in run_test failed
[RT] Error: Test hafs_regional_docn 107 failed in run_test failed
[RT] Error: Test hafs_regional_docn_oisst 108 failed in run_test failed
[RT] Error: Test control_atmwav 123 failed in run_test failed
[RT] Error: Test atmaero_control_p8 124 failed in run_test failed
Please make changes and add the following label back: jet-intel-RT

denise worthen and others added 2 commits June 26, 2022 14:15
@DeniseWorthen
Copy link
Copy Markdown
Collaborator Author

DeniseWorthen commented Jun 26, 2022

On wcoss-dell-p3, the hafs_regional_1nest_atm has hung right at the beginning three times.

20220626 150903.866 INFO             PET000 af FieldBundleRegridStore
20220626 150903.866 INFO             PET000 ... returned from wrtFB(05,01) FieldBundleRegridStore().
20220626 150903.866 INFO             PET000 bf FieldBundleRegridStore
20220626 150903.866 INFO             PET000 calling into wrtFB(06,01) FieldBundleRegridStore()....

and eventually in the log file:

TERM_RUNLIMIT: job killed after reaching LSF run time limit.

Comment thread tests/default_vars.sh Outdated
@DeniseWorthen
Copy link
Copy Markdown
Collaborator Author

@jkbk2004 You made the fix in default_vars to run the test on jet, but did not push it?

@jkbk2004
Copy link
Copy Markdown
Collaborator

@jkbk2004 You made the fix in default_vars to run the test on jet, but did not push it?

@DeniseWorthen I will push and then merge.

@jkbk2004 jkbk2004 merged commit 9fd9d1c into ufs-community:develop Jun 27, 2022
@DeniseWorthen DeniseWorthen deleted the feature/atmaero_resources2 branch June 29, 2022 13:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Baseline Updates Current baselines will be updated. Waiting for Reviews The PR is waiting for reviews from associated component PR's.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Adjust resources for atmaero_control_p8; turn test on for cheyenne.intel

3 participants