diagnostics kinda clean by JorgeG94 · Pull Request #113 · marshallward/MOM6

JorgeG94 · 2026-03-25T00:37:34Z

No description provided.

edoyango · 2026-03-25T21:13:21Z

+  ! endif

- if (present(net_err)) net_err = uh_err
+  ! GPU: present() not supported on GPU — net_err is never passed from diag remap callers


what happens for you when using present? I've used it before in procedures in gpu code when trying different porting strategies for continuity.

did you try it inside a routine that is declared as !$omp declare target ? I was segfaulting

HUH maybe my issue was something else

I think it was a dual fold issue with smething else going wrong

edoyango · 2026-03-25T21:16:21Z

+!> Maximum number of vertical levels supported for GPU-resident local arrays.
+!! Variable-length locals inside declare-target routines force nvfortran to use
+!! GPU heap (NVCOMPILER_ACC_CUDA_HEAPSIZE). Fixed-size arrays use stack instead.
+integer, parameter, public :: NK_GPU_MAX = 500


an alternative is to create the tmp arrays (if necessary, make them private) outside the routine and pass them in. This gets around the heap issue.

Not yet sure if that's useful here, but mentioning it in case we need it.

edoyango · 2026-03-25T21:18:48Z

  ! Local variables
  integer :: c, nk, i, j, k
  type(axes_grp), pointer :: axes => NULL(), h_axes => NULL() ! Current axes, for convenience
+  ! Local pointer aliases to avoid derived-type components in OpenMP map clauses


was this done for convenience, or was there a real limitation?

I was having issues with mapping the underlying structures, I would get it to compile and then at runtime they'd not be mapped on the GPU and this was a convenient workaround

edoyango · 2026-03-25T21:33:08Z

+  !   call CS%reconstruction%reconstruct(h0, u0)
+  !   call CS%reconstruction%remap_to_sub_grid(h0, u0, n1, h_sub, &
+  !                                            isrc_start, isrc_end, isrc_max, isub_src, &
+  !                                            u_sub, uh_sub, u02_err)


is these two methods that are dispatched by class? since this routine seems to operate columnwise with the ij loop being outside, could it be easier to move the ij loops inside the class-dispatched routines?

hmmm could be a nice idea, let me look how big of a change this might be

I think this would add a lot of boilerplate right? we'd need a GPU impl for mapping

marshallward · 2026-04-22T20:31:56Z

This is now being handled in #153 ? Can we close this one?

diagnostics kinda clean

c5810df

edoyango reviewed Mar 25, 2026

View reviewed changes

JorgeG94 closed this Apr 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diagnostics kinda clean#113

diagnostics kinda clean#113
JorgeG94 wants to merge 1 commit into
dev/gpufrom
jorge/diagnostics_port

JorgeG94 commented Mar 25, 2026

Uh oh!

edoyango Mar 25, 2026

Uh oh!

JorgeG94 Mar 25, 2026

Uh oh!

JorgeG94 Mar 25, 2026

Uh oh!

JorgeG94 Mar 25, 2026

Uh oh!

edoyango Mar 25, 2026

Uh oh!

edoyango Mar 25, 2026

Uh oh!

JorgeG94 Mar 25, 2026

Uh oh!

edoyango Mar 25, 2026

Uh oh!

JorgeG94 Mar 25, 2026

Uh oh!

JorgeG94 Mar 26, 2026

Uh oh!

marshallward commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JorgeG94 commented Mar 25, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marshallward commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants