Conversation
f0a50a6 to
e3a9e4b
Compare
|
I think we should have an operation in the TritonGEN dialect for WaveAll and WaveCluster. I discussed the design of that operation yesterday with @whitneywhtsang. I'm hoping she can create that operation and then we generate it in Triton. |
It will likely also be helpful to add the WavePrefix intrinsic for scans (going by the small bit of documentation). |
I will create a new PR based on the TritonGen operation. |
I created an issue to track this new requirements. |
@whitneywhtsang @etiotto add an op in tritongen for attention, FYI. |
I am working on #1214, will put you as reviewer once it is ready, so you can check if it misses any features you need for your case. |
great, this is the design align to our current. |
dc58c5d to
4fed409
Compare
|
The operator is added in TritonGEN dialect, please update the PR to use it. |
4fed409 to
3d0a0b9
Compare
Use the GenISA WaveAll and WaveCluster in warp horizontal reduce.