Skip to content

Conversation

@JackWeiw
Copy link
Contributor

In driver_api.cc,

// MergeSharedMemoryAllocations must be applied after SplitHostDevice
 // because the merged allocation site is at the beginning of each device function

howerver MergeSharedMemoryAllocations didn't take care of call op ptx_cp_async,
result in error
script to reproduce
image
as we can see it didn't successfully merge shared memory in ptx_cp_async
image

@JackWeiw
Copy link
Contributor Author

CC @vinx13

@vinx13 vinx13 merged commit 78ba385 into apache:main Mar 30, 2024
thaisacs pushed a commit to thaisacs/tvm that referenced this pull request Apr 3, 2024
* [BugTIR]fix error merging shared memory for ptx_cp_async

* run black format

* fix get dtype of ptx_cp_async

* get correct offset of ptx_cp_async

* black format
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants