Skip to content

Changing sgpr limits (#304)#514

Merged
mahmoodw merged 1 commit into
release-staging/rocm-rel-7.0from
users/mahmoodw/sgprLimit
Jul 10, 2025
Merged

Changing sgpr limits (#304)#514
mahmoodw merged 1 commit into
release-staging/rocm-rel-7.0from
users/mahmoodw/sgprLimit

Conversation

@mahmoodw
Copy link
Copy Markdown
Contributor

@mahmoodw mahmoodw commented Jul 7, 2025

This includes 2 changes:

  • Unrestricted the temp sgprs needed for gsu from being contiguous, avoiding overflow for certain kernels
  • Account for additional temp sgprs that will be required for code gen, up to physical limits

🔁 Imported from
ROCm/hipBLASLt#2184 🧑‍💻 Originally authored by @mahmoodw


stanleytsang-amd pushed a commit to stanleytsang-amd/rocm-libraries that referenced this pull request Jul 8, 2025
* Structuring backend (ROCm#495)

* split rocsparse_handle.hpp

* split rocsparse_handle.cpp

* structuring trm_info without behaviour change

* fixing test failure related to copy_mat_info

* Fixing default initialization of rocsparse_trm_info.
Removing error prone cast when calling hipMalloc

* Fixing missing behavior check in spgeam (ROCm#514)

* Fixing missing behavior check.

* stage analysis and stage symbolic are not required for stage numeric

* spgeam final fix (ROCm#516)

* spgeam fix

---------

Co-authored-by: Yvan Mokwinski <yvan.mokwinski@gmail.com>
bstefanuk pushed a commit that referenced this pull request Jul 8, 2025
* Fixing missing behavior check.

* stage analysis and stage symbolic are not required for stage numeric
In regards to LWPTENSILE-1696

This includes 2 changes:
- Unrestricted the temp sgprs needed for gsu from being contiguous,
avoiding overflow for certain kernels
- Account for additional temp sgprs that will be required for code gen,
up to physical limits

---
🔁 Imported from
[ROCm/hipBLASLt#2184](ROCm/hipBLASLt#2184)
🧑‍💻 Originally authored by @mahmoodw

---------

Co-authored-by: mahmoodw <wmahmood@amd.com>
Co-authored-by: mahmoodw <44450175+mahmoodw@users.noreply.github.com>
@AlexBrownAMD AlexBrownAMD force-pushed the users/mahmoodw/sgprLimit branch from 18e667f to 62bd644 Compare July 9, 2025 19:27
@mahmoodw mahmoodw merged commit a10446c into release-staging/rocm-rel-7.0 Jul 10, 2025
8 of 9 checks passed
@mahmoodw mahmoodw deleted the users/mahmoodw/sgprLimit branch July 10, 2025 18:54
assistant-librarian Bot pushed a commit to ROCm/hipBLASLt that referenced this pull request Jul 10, 2025
Changing sgpr limits (#304)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

This includes 2 changes:
- Unrestricted the temp sgprs needed for gsu from being contiguous,
avoiding overflow for certain kernels
- Account for additional temp sgprs that will be required for code gen,
up to physical limits
ammallya pushed a commit that referenced this pull request Jul 14, 2025
* Fixing missing behavior check.

* stage analysis and stage symbolic are not required for stage numeric

[ROCm/rocSPARSEcommit: 151145a]
ammallya pushed a commit that referenced this pull request Jul 14, 2025
* Structuring backend (#495)

* split rocsparse_handle.hpp

* split rocsparse_handle.cpp

* structuring trm_info without behaviour change

* fixing test failure related to copy_mat_info

* Fixing default initialization of rocsparse_trm_info.
Removing error prone cast when calling hipMalloc

* Fixing missing behavior check in spgeam (#514)

* Fixing missing behavior check.

* stage analysis and stage symbolic are not required for stage numeric

* spgeam final fix (#516)

* spgeam fix

---------

Co-authored-by: Yvan Mokwinski <yvan.mokwinski@gmail.com>

[ROCm/rocSPARSEcommit: 22d1de0]
ammallya pushed a commit that referenced this pull request Jul 14, 2025
* Structuring backend (#495)

* split rocsparse_handle.hpp

* split rocsparse_handle.cpp

* structuring trm_info without behaviour change

* fixing test failure related to copy_mat_info

* Fixing default initialization of rocsparse_trm_info.
Removing error prone cast when calling hipMalloc

* Fixing missing behavior check in spgeam (#514)

* Fixing missing behavior check.

* stage analysis and stage symbolic are not required for stage numeric

* spgeam final fix (#516)

* spgeam fix

---------

Co-authored-by: Yvan Mokwinski <yvan.mokwinski@gmail.com>

[ROCm/rocSPARSE commit: 22d1de0]
AlexBrownAMD pushed a commit that referenced this pull request Jul 15, 2025
This includes 2 changes:
- Unrestricted the temp sgprs needed for gsu from being contiguous,
avoiding overflow for certain kernels
- Account for additional temp sgprs that will be required for code gen,
up to physical limits

---
🔁 Imported from
[ROCm/hipBLASLt#2184](ROCm/hipBLASLt#2184) 🧑‍💻
Originally authored by @mahmoodw

---------

Co-authored-by: assistant-librarian[bot] <210906412+assistant-librarian[bot]@users.noreply.github.com>
Co-authored-by: mahmoodw <wmahmood@amd.com>
SathiyarajRam pushed a commit that referenced this pull request Jul 15, 2025
This includes 2 changes:
- Unrestricted the temp sgprs needed for gsu from being contiguous,
avoiding overflow for certain kernels
- Account for additional temp sgprs that will be required for code gen,
up to physical limits

---
🔁 Imported from
[ROCm/hipBLASLt#2184](ROCm/hipBLASLt#2184) 🧑‍💻
Originally authored by @mahmoodw

---------

Co-authored-by: assistant-librarian[bot] <210906412+assistant-librarian[bot]@users.noreply.github.com>
Co-authored-by: mahmoodw <wmahmood@amd.com>
ammallya pushed a commit that referenced this pull request Oct 27, 2025
…s-core-1.18.1

Bump rocm-docs-core from 1.17.1 to 1.18.1 in /docs/sphinx
ammallya pushed a commit that referenced this pull request Oct 28, 2025
…s-core-1.18.1

Bump rocm-docs-core from 1.17.1 to 1.18.1 in /docs/sphinx

[ROCm/rocwmma commit: b0fff32]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants