gmtb-gfsphysics optimization: speedup of ccpp init#48
Conversation
…MP_generic_pre.f90
…calpreciptype.f90
…) derived type so that it gets allocated in the right place in memory (first touch principle): GFS_layer/GFS_driver.F90
…ata_block to avoid reading SDF multiple times, cleanup and formatting changes: IPD_layer/IPD_CCPP_driver.F90
… into gmtb-gfsphysics-optimization
|
Dom - My tests today, and yesterday, are not giving bit-for-bit results, but I'm not sure that they should. The branch I used yesterday (gmtb-fv3-cleanup-and-error-handling) may or may not have had all of the same fixes as this one. Is this a concern worth tracking down? If your tests show bit-for-bit before/after, that's sufficient for me. |
|
@llpcarson Did you create a new baseline or use my baseline? With the changes to the optimization flags in gmtb-gfsphysics PR47, the results have changed (see discussion there) and I did not update the "official" baseline yet. The following one works for me: /scratch4/BMC/gmtb/Dom.Heinzeller/gmtb-fv3/reference/intel/C96fv3gfs2016092900/ |
|
I was just comparing my run from yesterday and mine from today, so I
believe both of them have the PR47 included, but I'll re-confirm
That was it --- the branch I tested yesterday did not have that PR included... so this one is ready to commit!
…On Wed, Mar 14, 2018 at 3:40 PM, Dom Heinzeller ***@***.***> wrote:
@llpcarson <https://github.com/llpcarson> Did you create a new baseline
or use my baseline? With the changes to the optimization flags in
gmtb-gfsphysics PR47, the results have changed (see discussion there) and I
did not update the "official" baseline yet. The following one works for me:
/scratch4/BMC/gmtb/Dom.Heinzeller/gmtb-fv3/reference/
intel/C96fv3gfs2016092900/
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#48 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AHTrInvKV9vyNS7RvwbdLOr-vOUYKqghks5teY5MgaJpZM4Sq3qH>
.
|
|
My reference is from straight after pr47 went in, i.e. March 12/13, before the merge this morning and the PRs afterwards. I do get bfb identical results for the C96 test case. But I definitely want to understand why you don't! |
|
My runs today match that baseline, too. So, the branch I grabbed earlier
was pre-PR47 (which was OK for timing, but not for b4b)
…On Wed, Mar 14, 2018 at 3:47 PM, Dom Heinzeller ***@***.***> wrote:
My reference is from straight after pr47 went in, i.e. March 12/13, before
the merge this morning and the PRs afterwards. I do get bfb identical
results for the C96 test case.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#48 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AHTrImwIzphn7EXSgSTDdIO5JbJnZfrJks5teZAOgaJpZM4Sq3qH>
.
|
|
Great, thank you! |
* fv3atm issue NCAR#37: fix the real(8) lat/lon in netcdf file * fv3atm NCAR#35: Reducing background vertical diffusivities in the inversion layers * fv3atm NCAR#24: bug in gfsphysics/physics/moninedmf_hafs.f * fv3atm NCAR#18: Optimize netcdf write component and bugfix for post and samfdeepcnv.f * set (0-1) bounds for ficein_cpl * remove cache_size due to lower netcdf verion 4.5.1 on mars * Change ice falling to 0.9 in gfsphysics/physics/gfdl_cloud_microphys.F90
…r_master Update gsd/develop from NCAR master
This PR improves the performance of the CCPP implementation in FV3 v0 by
This PR also
The results are bit for bit compatible (tested on Theia/Intel and Macbook/GNU with OpenMP enabled).