-
Couldn't load subscription status.
- Fork 258
[ROCm] CMake and lit changes to enable AMDGPU support. #57
Conversation
|
All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter. We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the ℹ️ Googlers: Go here for more info. |
4b7e965 to
d53b8d9
Compare
|
CLAs look good, thanks! ℹ️ Googlers: Go here for more info. |
|
Hi, Thanks for the contribution! It is nice to see another GPU target :) What is the status of this pull-request right now? Are you seeking early feedback or are you looking for a quick path to integrate this upstream? |
|
@joker-eph Thanks for your prompt reply. All tests are passing on this PR, which means we can run MLIR core end-to-end on AMD ROCm platform. 4 unsupported tests are those on CUDA platform. We are starting building internal tools based on work derived from this PR, so we'd like to understand the steps to get the PR revised and merged. Granted there are quite a few duplication between I'll remove my change to |
338e28e to
7534424
Compare
Fantastic!
Right, I suspect we will want to integrate this piece by piece, in PRs as small as possible. I believe it would be best to start with an RFC on the public mailing-list to discuss this and involve the folks actively working on GPU targets already (SPIR-V and Cuda). Thanks! |
|
@joker-eph Thanks. I'll revise this PR and break it into smaller patches, probably tablegen for |
76dc57f to
0ca81f7
Compare
|
@joker-eph I've reduced this PR to a minimal size. Will file subsequent ones moving forward, plus submitting RFCs to the public mailing list. |
|
Thanks! From what I see in the diff now, these are mostly only build configuration changes left? I expect these to land with the actual runner in one of the later PRs. |
|
@joker-eph Yeah, come to think of it there is really no meat in this PR now. Let me keep this PR open for awhile, and use this PR as a placeholder for |
|
Closing for now (cleanup), feel free to re-open as needed though! |
GPUToROCDLto map thread/block/grid device functions to corresponding intrinsics and device functions on AMD GPU.GPUToROCmto drive MLIR->LLVM->HSA code object lowering process. Some passes are pretty similar with those inGPUToCUDAand may deserve to be merged.mlir-rocm-runnerin mlir/tools.Authors: