Skip to content

Conversation

kasparas-k
Copy link
Contributor

The recent change d9082e8 breaks backward pass benchmark, due to all calls of _wmma_implicit_gemm_backward_logic doing tuple unpacking expecting two elements.

Since other non-tensor returns give the index i as the second element, I set the return index to -1, as the return happens before any iterations. If there is a better integer value to return, we can change it.

@chrischoy chrischoy merged commit aea0287 into NVlabs:main Oct 9, 2025
@kasparas-k kasparas-k deleted the wmma_backward_benchmark branch October 9, 2025 18:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants