Skip to content

Add flexible controls of GPU configuration and remove old GPU configuration#97

Merged
jedwards4b merged 27 commits into
ESMCI:mainfrom
sjsprecious:add_gpu_gust
Aug 9, 2023
Merged

Add flexible controls of GPU configuration and remove old GPU configuration#97
jedwards4b merged 27 commits into
ESMCI:mainfrom
sjsprecious:add_gpu_gust

Conversation

@sjsprecious
Copy link
Copy Markdown
Collaborator

@sjsprecious sjsprecious commented Apr 17, 2023

This PR:

  • updates the configurations for the GPU offload based on the new XML variables in CMEPS (Add new XML variables for flexible controls of GPU configuration ESCOMP/CMEPS#363).
  • removes the old approaches for GPU configurations.
  • updates the module versions and batch script template on Derecho.
  • removes #PBS -V option as it incorrectly populate the NGPUS env variable from the login node to the compute node on Derecho.

Use nvhpc/22.11, cray-mpich/8.1.21, and esmf/8.4.1.b02

	modified:   machines/cmake_macros/gust.cmake
	modified:   machines/config_machines.xml
…_config_cesm/compare/52c06b3..fbc05d6

Update settings on Gust and Casper

	modified:   Depends.nvhpc
	deleted:    Depends.nvhpc-gpu
	deleted:    cmake_macros/nvhpc-gpu.cmake
	deleted:    cmake_macros/nvhpc-gpu_casper.cmake
	modified:   cmake_macros/nvhpc.cmake
	modified:   cmake_macros/nvhpc_casper.cmake
	deleted:    cmake_macros/pgi-gpu.cmake
	deleted:    cmake_macros/pgi-gpu_casper.cmake
	modified:   config_batch.xml
	modified:   config_machines.xml
	deleted:    mpi_run_gpu.casper
…asper and Gust

	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
load cuda module on Gust for a GPU run

	deleted:    cmake_macros/nvhpc_gust.cmake
	modified:   config_machines.xml
	modified:   machines/config_machines.xml
	modified:   config_batch.xml
	modified:   machines/config_batch.xml
	modified:   machines/config_machines.xml
	modified:   config_machines.xml
	modified:   machines/config_machines.xml
@sjsprecious
Copy link
Copy Markdown
Collaborator Author

@jedwards4b will issue a separate PR to introduce these changes.

@sjsprecious sjsprecious requested a review from jedwards4b August 8, 2023 16:33
@sjsprecious
Copy link
Copy Markdown
Collaborator Author

I updated the nvhpc compiler version to 23.5 with a newer version of ESMF8.6.0b03. The GPU run finished successfully on Derecho.

@jedwards4b jedwards4b merged commit fe65a9c into ESMCI:main Aug 9, 2023
@sjsprecious sjsprecious deleted the add_gpu_gust branch August 9, 2023 19:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants