Skip to content

RK2 loop with GPU halo support#48

Merged
marshallward merged 6 commits into
dev/gpufrom
rk-dev-with-gpu
Nov 13, 2025
Merged

RK2 loop with GPU halo support#48
marshallward merged 6 commits into
dev/gpufrom
rk-dev-with-gpu

Conversation

@marshallward
Copy link
Copy Markdown
Owner

Uses the new GPU halo exchange support in the FMS fork.

AFAIK this does all the halo updates on (although only tested on a reentrant 1-GPU job).

This won't go in until everyone is able to build and run with the FMS GPU dev fork.

(Also d69264c was probably supposed to be in the previous PR, but it's fairly minor.)

marshallward and others added 5 commits September 29, 2025 00:21
NVIDIA was correcting for our missing uhtr allocations on the GPU in the
slowest way possible.  This patch adds an explicit allocation, which
reduced the number of transfers dramatically.
ALso:
* Declate [uv]int_cor in MOM_barotropic as loop-locals
* Add h to data transfer after RK2 step
@marshallward
Copy link
Copy Markdown
Owner Author

Also, we should probably not take this in until we've added a config_src/infra which supports both the current and the GPU dev FMS forks.

The default FMS is not compatible with the new GPU-based MPI methods, so
we just change it in .testing/Makefile for now.  Down the road, we
should probably move this into the .github config.

DO_LOCALITY() now returns a ; if do concurrent locality modifiers are
unsupported.  This prevents errors associated with line continuations.
@codecov
Copy link
Copy Markdown

codecov Bot commented Nov 13, 2025

Codecov Report

❌ Patch coverage is 86.95652% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 37.68%. Comparing base (4f7c405) to head (5e5fd40).
⚠️ Report is 6 commits behind head on dev/gpu.

Files with missing lines Patch % Lines
src/core/MOM_barotropic.F90 81.81% 1 Missing and 1 partial ⚠️
src/core/MOM_dynamics_split_RK2.F90 90.00% 0 Missing and 1 partial ⚠️
Additional details and impacted files
@@           Coverage Diff            @@
##           dev/gpu      #48   +/-   ##
========================================
  Coverage    37.68%   37.68%           
========================================
  Files          304      304           
  Lines        92475    92474    -1     
  Branches     17741    17741           
========================================
  Hits         34849    34849           
+ Misses       51075    51074    -1     
  Partials      6551     6551           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@marshallward
Copy link
Copy Markdown
Owner Author

The CI is now fixed, it now points to the GPU version of FMS.
(Also fixed a bug in the DO_LOCALITY() macro)

@marshallward marshallward merged commit cd178dd into dev/gpu Nov 13, 2025
106 checks passed
@marshallward marshallward mentioned this pull request Nov 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants