-
Notifications
You must be signed in to change notification settings - Fork 776
{lib}[gfbf/2024a] nvidia-cutlass v3.8.0.0 w/ CUDA 12.6.0 #23606
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
boegel
merged 10 commits into
easybuilders:develop
from
Flamefire:20250808105302_new_pr_nvidia-cutlass3800
Dec 19, 2025
Merged
Changes from all commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
5c35bec
adding easyconfigs: nvidia-cutlass-3.8.0.0-gfbf-2024a-CUDA-12.6.0.eb
Flamefire b2ed434
Add treelib dependency
Flamefire dbacb3d
Include treelib in cutlass
Flamefire 0ccffbe
Fix Byteswarning
Flamefire a6d78b0
Merge branch 'easybuilders:develop' into 20250808105302_new_pr_nvidia…
Flamefire b80c2ba
fix minor code style issue in nvidia-cutlass easyconfig
boegel 987a9c4
Change homepage URL for nvidia-cutlass
Flamefire 109b446
Move sanity_check_commands to top-level
Flamefire 0ace2aa
Merge branch 'easybuilders:develop' into 20250808105302_new_pr_nvidia…
Flamefire 93c5a08
don't set enhance_sanity_check in nvidia-cutlass easyconfgi
boegel File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
66 changes: 66 additions & 0 deletions
66
easybuild/easyconfigs/n/nvidia-cutlass/nvidia-cutlass-3.8.0.0-gfbf-2024a-CUDA-12.6.0.eb
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,66 @@ | ||
| easyblock = 'PythonBundle' | ||
|
|
||
| name = 'nvidia-cutlass' | ||
| version = '3.8.0.0' | ||
| versionsuffix = '-CUDA-%(cudaver)s' | ||
|
|
||
| homepage = 'https://pypi.org/project/nvidia-cutlass' | ||
| description = """ | ||
| CUTLASS is a collection of CUDA C++ template abstractions for implementing high-performance | ||
| matrix-matrix multiplication (GEMM) and related computations at all levels and scales within CUDA. | ||
| It incorporates strategies for hierarchical decomposition and data movement similar to those used | ||
| to implement cuBLAS and cuDNN. | ||
| CUTLASS decomposes these "moving parts" into reusable, modular software components abstracted by C++ template classes. | ||
| Primitives for different levels of a conceptual parallelization hierarchy can be specialized and tuned | ||
| via custom tiling sizes, data types, and other algorithmic policy. | ||
| The resulting flexibility simplifies their use as building blocks within custom kernels and applications. | ||
| """ | ||
|
|
||
| toolchain = {'name': 'gfbf', 'version': '2024a'} | ||
|
|
||
| builddependencies = [ | ||
| ('poetry', '1.8.3'), | ||
| ] | ||
|
|
||
| dependencies = [ | ||
| ('CUDA', '12.6.0', '', SYSTEM), | ||
| ('CUDA-Python', '12.6.2.post1', versionsuffix), | ||
| ('Python', '3.12.3'), | ||
| ('Python-bundle-PyPI', '2024.06'), | ||
| ('SciPy-bundle', '2024.05'), | ||
| ('networkx', '3.4.2'), | ||
| ('pydot', '3.0.3'), | ||
| ] | ||
|
|
||
| exts_list = [ | ||
| ('treelib', '1.8.0', { | ||
| 'sources': [SOURCE_TAR_GZ], | ||
| 'checksums': ['e1be2c6b66ffbfae85079fc4c76fb4909946d01d915ee29ff6795de53aed5d55'], | ||
| }), | ||
| (name, version, { | ||
| 'source_tmpl': 'nvidia_cutlass-%(version)s-py3-none-any.whl', | ||
| 'post_install_patches': [{ | ||
| 'name': 'nvidia-cutlass-3.8.0.0_fix-BytesWarning.patch', | ||
| 'sourcepath': 'lib/python%(pyshortver)s/site-packages/cutlass', | ||
| 'level': 3, | ||
| }], | ||
| 'checksums': [ | ||
| '013147221a63500205da233ae02e6262463917f3fe39cb09efbca37bfd1c39f9', | ||
| {'nvidia-cutlass-3.8.0.0_fix-BytesWarning.patch': | ||
| '63eb47894340c0ea03d0d2faaa49c1979915f903b5bc2ced17f8e0dd5ab854ed'}, | ||
| ], | ||
| 'modulename': 'cutlass', | ||
| }), | ||
| ] | ||
|
|
||
| sanity_check_commands = [ | ||
| 'python -sc "import cutlass_library"', | ||
| 'python -bb -sc "' + '; '.join(( | ||
| 'import cutlass', | ||
| # These serves as a smoke test, e.g. nvcc_version() was incompatible with -bb | ||
| "assert cutlass.nvcc_version().startswith('%(cudamajver)s')", | ||
| "assert cutlass.cuda_install_path() == '$EBROOTCUDA'", | ||
| )) + '"', | ||
| ] | ||
|
|
||
| moduleclass = 'lib' | ||
39 changes: 39 additions & 0 deletions
39
easybuild/easyconfigs/n/nvidia-cutlass/nvidia-cutlass-3.8.0.0_fix-BytesWarning.patch
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,39 @@ | ||
| When neither `text` nor `encoding` is specified to `subprocess.run` then `stdout` will be of type `bytes`. | ||
| A subsequent `str(stdout)` causes a `BytesWarning` which might result in errors, e.g. if `python -bb` is used. | ||
| See https://github.com/NVIDIA/cutlass/pull/2682 | ||
|
|
||
| Fixes hard failures in PyTorch tests that do use `-bb`. | ||
|
|
||
| Author: Alexander Grund (TU Dresden) | ||
|
|
||
| diff --git a/python/cutlass_cppgen/__init__.py b/python/cutlass_cppgen/__init__.py | ||
| index 9bdd259c02..0e28ff55fd 100644 | ||
| --- a/python/cutlass_cppgen/__init__.py | ||
| +++ b/python/cutlass_cppgen/__init__.py | ||
| @@ -39,11 +39,11 @@ | ||
| def _cuda_install_path_from_nvcc() -> str: | ||
| import subprocess | ||
| # Attempt to detect CUDA_INSTALL_PATH based on location of NVCC | ||
| - result = subprocess.run(['/usr/bin/which', 'nvcc'], capture_output=True) | ||
| + result = subprocess.run(['/usr/bin/which', 'nvcc'], capture_output=True, text=True) | ||
| if result.returncode != 0: | ||
| raise Exception(f'Unable to find nvcc via `which` utility.') | ||
|
|
||
| - cuda_install_path = result.stdout.decode('utf-8').split('/bin/nvcc')[0] | ||
| + cuda_install_path = result.stdout.split('/bin/nvcc')[0] | ||
| if not os.path.isdir(cuda_install_path): | ||
| raise Exception(f'Environment variable "CUDA_INSTALL_PATH" is not defined, ' | ||
| f'and default path of {cuda_install_path} does not exist.') | ||
| @@ -63,10 +63,10 @@ def nvcc_version(): | ||
| import subprocess | ||
|
|
||
| # Attempt to get NVCC version | ||
| - result = subprocess.run(['nvcc', '--version'], capture_output=True) | ||
| + result = subprocess.run(['nvcc', '--version'], capture_output=True, text=True) | ||
| if result.returncode != 0: | ||
| raise Exception('Unable to run `nvcc --version') | ||
| - _NVCC_VERSION = str(result.stdout).split(" release ")[-1].split(",")[0] | ||
| + _NVCC_VERSION = result.stdout.split(" release ")[-1].split(",")[0] | ||
| return _NVCC_VERSION | ||
|
|
||
| _CUDA_INSTALL_PATH = None |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.