[Demo] Wire ROCm Python package tests through ci.yml workflow paths#3182
Closed
ScottTodd wants to merge 17 commits into
Closed
[Demo] Wire ROCm Python package tests through ci.yml workflow paths#3182ScottTodd wants to merge 17 commits into
ScottTodd wants to merge 17 commits into
Conversation
Add script to upload Python packages to S3 or a local directory. This is the first step toward CI testing of ROCm Python packages (issue #1559). Features: - Uploads wheels and sdists to S3 using retrieve_bucket_info() for bucket selection - Supports --output-dir for local testing without S3 - Supports --bucket override for testing with therock-artifacts-testing - Supports --dry-run to preview operations - Adds package list to GitHub job summary Not yet implemented: - Pip index generation (piprepo integration) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Replace piprepo (external dependency) with the existing third-party/indexer/indexer.py to generate a flat index.html file. This works with pip's --find-links option and doesn't require S3 static website hosting to be enabled. Changes: - Replace run_piprepo_build() with generate_index() using indexer.py - Update docstring to show --find-links usage instead of --index-url - Simplify find_package_files() to look for index.html instead of **/*.html 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Integrate the Windows workflow with the upload and test infrastructure: - Add upload step using upload_python_packages.py to push packages to S3 - Add test_rocm_wheels job to run GPU tests after upload - Add generate_target_to_run job to determine Windows test runner - Add repository/ref inputs for workflow_call compatibility - Remove inline sanity check (testing happens on GPU machines) - Remove piprepo dependency (upload script uses indexer.py) This matches the pattern established in the Linux workflow, completing the Windows side of the Python package CI pipeline. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This was referenced Jan 30, 2026
Member
Author
|
CI results for this got us some interesting data:
The job summaries are working as expected: https://github.com/ROCm/TheRock/actions/runs/21533668625?pr=3182 I'd like to aggregate those summaries with a single index.html for each workflow run and artifact group too. |
Member
Great progress and nice work! I clearly see the benefits for a single index.html summary but let’s try to add this as a Lambda an not integrate it into a workflow please :) |
This was referenced Feb 2, 2026
ScottTodd
added a commit
that referenced
this pull request
Feb 3, 2026
## Motivation Progress on #1559. When adding Python package tests to CI workflows on #3182 we noticed that this job hit a 6 hour timeout on the `linux-strix-halo-gpu-rocm-5` runner: https://github.com/ROCm/TheRock/actions/runs/21533668625/job/62060067637?pr=3182. ## Technical Details I chose a 30 minute timeout for the overall job, to catch slow network issues and then a 5 minute timeout for just the test step, to give enough time for the packages to initialize and then catch any hung subprocesses. Note that we currently skip tests on Linux gfx1151 here: * https://github.com/ROCm/TheRock/blob/55fa089eecd5f20d13357b82b74b5572dda1d67b/build_tools/github_actions/amdgpu_family_matrix.py#L77-L87 * https://github.com/ROCm/TheRock/blob/55fa089eecd5f20d13357b82b74b5572dda1d67b/build_tools/print_driver_gpu_info.py#L24-L26 We may want to similarly filter Linux gfx1151 from parts of `rocm-sdk test`. ## Test Plan and Results Triggered a test run with ROCm version `7.12.0a20260202`: https://github.com/ROCm/TheRock/actions/runs/21644961000 All tests passed in 1 minute, though the `linux-strix-halo-gpu-rocm-7` runner was used for those runs. ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests. Co-authored-by: Claude <noreply@anthropic.com>
This was referenced Feb 4, 2026
ScottTodd
added a commit
that referenced
this pull request
Feb 4, 2026
## Motivation Progress on #1559. The [`build_tools/linux_portable_build.py`](https://github.com/ROCm/TheRock/blob/main/build_tools/linux_portable_build.py) script is an unnecessary layer of indirection in a github actions context that makes our Windows and Linux workflows look quite different. Future work will also use either the AWS CLI (installed in the build container already) or boto3 to upload the Python packages that were built. ## Technical Details I've also included some related workflow cleanups in this PR to better align the Windows and Linux workflows. We should also be able to simplify [`.github/workflows/release_portable_linux_packages.yml`](https://github.com/ROCm/TheRock/blob/main/.github/workflows/release_portable_linux_packages.yml) (on its own, or by calling this workflow) and then either delete that script or keep it relevant only in documentation like https://github.com/ROCm/TheRock/blob/main/docs/environment_setup_guide.md#reference-build-environments (there may be better options like Dev Containers that wouldn't involve a custom script) One reason to keep the script instead of running the entire job under the container is this "prefetch" code that tries to run multiple async steps in parallel: https://github.com/ROCm/TheRock/blob/c2d46f9f6ede1c6e58255cda08c193f0a3714e12/.github/workflows/release_portable_linux_packages.yml#L212-L216 With GitHub Actions, the "Initialize containers" step does block all workflow progress until complete (this takes [about 1 minute](https://github.com/ROCm/TheRock/actions/runs/21610738922/job/62278660439)). A better solution there could be to preload the docker container on our build VM runners. ## Test Plan * Tested together with other changes as part of #3182 * Test run using artifacts from a prior workflow run: https://github.com/ROCm/TheRock/actions/runs/21610738922 * CI on this PR, e.g. https://github.com/ROCm/TheRock/actions/runs/21611321350/job/62286595392?pr=3214 ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests. --------- Co-authored-by: Claude <noreply@anthropic.com>
1 task
ScottTodd
added a commit
that referenced
this pull request
Feb 5, 2026
## Motivation Progress on #1559. I'm splitting this off from #3182. In #3136, I'm using [`indexer.py`](https://github.com/ROCm/TheRock/blob/main/third-party/indexer/indexer.py) to generate an index page for a CI workflow run that can be installed from using `--find-links` (and _not_ `--index-url`). For example: https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html. For workflows and developers to be able to conveniently create Python venvs and install those packages, this setup_venv.py script needed support for `--find-links` too. ## Technical Details Key points about the refactoring: 1. Changed scraping signature / behavior to returns the union of all subdirs instead of a distinct list of subdirs per package index. If someone tries to install a package from an index that doesn't have that package, that will be a later error. * before: `scrape_subdirs() -> dict[str, set[str]] | set[str] | None` * after: `_scrape_rocm_index_subdirs() -> set[str] | None` 2. Reworked branching between `pip` and `uv` to make it more clear what is being used where and why 3. Streamlined `run()` to emphasize the `create_venv` -> `update_venv` -> `install_packages_into_venv` flow 4. Allowed setting `--packages` without `--index-url` or `--index-name` (to install from the default pypi index) 5. Added new `--find-links` option (can be used together with `--index-url` as they are compatible/complimentary) Documentation for reference: * https://pip.pypa.io/en/stable/cli/pip_install/ * https://docs.astral.sh/uv/pip/packages/ ## Test Plan * CI usage in some existing workflows * New unit tests for helper functions ## Test Result Manual testing: <details><summary>With `--find-links` and pip</summary> <p> ``` D:\scratch\therock λ python D:\projects\TheRock\build_tools/setup_venv.py test.venv --packages rocm[libraries]==7.12.0.dev0 --find-links=https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html --clean Clearing existing venv_dir 'test.venv' Creating venv at 'test.venv' Dir relative to CWD: 'test.venv' Dir fully resolved : 'D:\scratch\therock\test.venv' ++ Exec [D:\scratch\therock]$ 'C:\Users\Nod-Shark16\AppData\Local\Programs\Python\Python313\python.exe' -m venv 'D:\scratch\therock\test.venv' ++ Exec [D:\scratch\therock]$ 'test.venv\Scripts\python.exe' -m pip install --upgrade pip Requirement already satisfied: pip in d:\scratch\therock\test.venv\lib\site-packages (25.1.1) Collecting pip Using cached pip-26.0-py3-none-any.whl.metadata (4.7 kB) Using cached pip-26.0-py3-none-any.whl (1.8 MB) Installing collected packages: pip Attempting uninstall: pip Found existing installation: pip 25.1.1 Uninstalling pip-25.1.1: Successfully uninstalled pip-25.1.1 Successfully installed pip-26.0 ++ Exec [D:\scratch\therock]$ 'test.venv\Scripts\python.exe' -m pip install --find-links=https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html 'rocm[libraries]==7.12.0.dev0' Looking in links: https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html Collecting rocm==7.12.0.dev0 (from rocm[libraries]==7.12.0.dev0) Using cached rocm-7.12.0.dev0-py3-none-any.whl Collecting rocm-sdk-core==7.12.0.dev0 (from rocm==7.12.0.dev0->rocm[libraries]==7.12.0.dev0) Using cached https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/rocm_sdk_core-7.12.0.dev0-py3-none-win_amd64.whl (654.1 MB) Collecting rocm-sdk-libraries-gfx110X-all==7.12.0.dev0 (from rocm[libraries]==7.12.0.dev0) Using cached https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/rocm_sdk_libraries_gfx110x_all-7.12.0.dev0-py3-none-win_amd64.whl (224.4 MB) Installing collected packages: rocm-sdk-libraries-gfx110X-all, rocm-sdk-core, rocm Successfully installed rocm-7.12.0.dev0 rocm-sdk-core-7.12.0.dev0 rocm-sdk-libraries-gfx110X-all-7.12.0.dev0 Setup complete at 'test.venv'! Activate the venv with: test.venv\Scripts\activate.bat ``` </p> </details> <details><summary>With `--find-links` and uv</summary> <p> ``` λ python D:\projects\TheRock\build_tools/setup_venv.py test_uv.venv --packages rocm[libraries]==7.12.0.dev0 --find-links=https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html --clean --use-uv --pre Clearing existing venv_dir 'test_uv.venv' Creating venv at 'test_uv.venv' Dir relative to CWD: 'test_uv.venv' Dir fully resolved : 'D:\scratch\therock\test_uv.venv' ++ Exec [D:\scratch\therock]$ uv venv 'D:\scratch\therock\test_uv.venv' Using CPython 3.13.5 interpreter at: C:\Users\Nod-Shark16\AppData\Local\Programs\Python\Python313\python.exe Creating virtual environment at: test_uv.venv Activate with: test_uv.venv\Scripts\activate ++ Exec [D:\scratch\therock]$ uv pip install --python 'test_uv.venv\Scripts\python.exe' --find-links=https://therock-artifacts-testing.s3.amazonaws.com/21440027240-windows/python/gfx110X-all/index.html --prerelease=allow 'rocm[libraries]==7.12.0.dev0' Using Python 3.13.5 environment at: test_uv.venv Resolved 3 packages in 162ms ░░░░░░░░░░░░░░░░░░░░ [0/3] Installing wheels... warning: Failed to hardlink files; falling back to full copy. This may lead to degraded performance. If the cache and target directories are on different filesystems, hardlinking may not be supported. If this is intentional, set `export UV_LINK_MODE=copy` or use `--link-mode=copy` to suppress this warning. Installed 3 packages in 1.09s + rocm==7.12.0.dev0 + rocm-sdk-core==7.12.0.dev0 + rocm-sdk-libraries-gfx110x-all==7.12.0.dev0 Setup complete at 'test_uv.venv'! Activate the venv with: test_uv.venv\Scripts\activate.bat ``` </p> </details> ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests. --------- Co-authored-by: Claude <noreply@anthropic.com>
Member
Author
|
Superseded by #3261 |
jayhawk-commits
pushed a commit
that referenced
this pull request
Mar 11, 2026
## Motivation Bump rocm-systems from 93bc019 to 093b66c (includes fix for hip-tests issue and revert for mathlib hiprtc issues and revert for rccl-test, added revert for miopen failures due to PR 653): Commits: 093b66c (HEAD, origin/develop, origin/HEAD) Revert "SWDEV-546177 - hipModuleGetLoadingMode API impl (#653)" (#3858) d8a0adb [AMD-SMI] Hide libamd_smi.so internal symbols (#3777) d4da458 [rocprofiler-sdk] [Documentation ] Updating changelog (#3827) 19fadeb (origin/users/abchoudh/fix_dispatch_count) [RCCL][Tuner Plugin] Enable tuning of RCCL tuning constants (#3757) b4f5f8a rocr: Fix IPC dmabuf hang with large allocations (#3211) 64efea0 RCCL: allow users to override max and per job memory & fix defaults. (#3797) 9b3dd10 Removing ready_for_review (#3849) 7e43880 [rocprofiler-systems] Update ROCm version to 7.2.0 in CI workflows for Debian, RedHat, and Ubuntu (#3431) 1fdb6b9 [rocshmem] add gda/topology unit tests (#3715) be1ea24 Move hipMipmappedArrayGetMemoryRequirements test to common tests e4513f0 Update amdgpu-windows-interop with latest changes, pal 58aa0bab2ced0cc9ebe8d2d0932db6774feb4e49 2026-03-04(#3773) b1f964d [rocprofiler-compute] Ensure long kernel name fully shows in compute analyze (#3665) 4dcf1e3 SWDEV-567112 - Replace test names (#3787) 33f5f30 ROCM-2428 - fixes hipStreamBatchMemOp invalid operation checks (#3099) 139f4bf [SWDEV-556456] Align HIP_UUID with rocminfo (#3614) 8e89285 Reduce buffers alignment to 4 bytes (#3821) 51be29a AIRUNTIME-125: Consolidate Windows optimization and debug flags (#3825) 1407392 [AMD-SMI] CI: Fix root workflow to use ASIC-specific test filters (#3807) 63f78a9 (origin/users/mcao/fix_rocpdsummary) [ROCM-SMI] Fix DRM include dirs leaking absolute build paths to consumers (#3808) caf2f7e [ROCM-186] amd-smi: Add support for a VRAM and GTT tuning interface (#3636) a0712d4 [TheRock CI] Update projects_to_test lists (#3749) 02090c4 rocrtst: install gfx .hsaco files to share/rocrtst (#3744) 4a0a1cb Merge other simd table (#3696) 0d07657 Add missing kwargs from rocprofiler_add_integration_validate_test in .cmake-format.yaml (#2336) 3a3df30 Optimize device counting service GPU interactions (#1583) 95d9da0 Add SPM Enable flag in build infrastructure (#3677) 12bb943 [rocprofiler-sdk] On-demand GPU profile queue creation/destruction (#3586) 941057c Navi4 tuning table iter 1 (#3052) dbf2b73 [AMD-SMI] Display N/A for cu_occupancy when file is unavailable (#3589) b0efc7c [RCCL] [UT] Add ROCTX test (#3625) ba7a20e Reducing the p2pnChannels for half-subscription A2A on multi-node MI350 (#3381) 75238c9 [clr] Fix memory leak in getOrCreateHostcallBuffer (#3699) af2ee0e [hip-tests] ASAN Check for image support before we create context (#3834) ad44966 Update windows ci subtree in include amdgpu-windows-interop (#3814) c8ad252 [rocprofiler-register] Fix compilation with system fmt/glog (#1243) 7818815 Update README to include dbgapi and debug agent components (#3731) 88e4a78 ROCProfiler and ROCTracer: Modifying deprecation note (#3831) b5918a5 [ROCM-3124-3125-3126] CUID file generation hangs on MI350 systems/CUID test failures/Segmentation fault in CUID example code (#3548) 97a5dd9 Update copyright to use SPDX IDs (#3805) 511730a [rocshmem]: add flood-amo tester (#3653) 2d650a0 [clr] Fix heap use after free error in device allocations (#3789) b6b179a Disable hipHostRegister_Negative test for ASAN (#3832) 39ec318 [RCCL] Add GDA alltoallv via rocshmem integration (#3613) fb0f4d5 [RCCL] [CUMEM] Fix cuMem multi-process runs (#3811) c3de7d4 SWDEV-526201 - Fix and enable disabled HIP tests from warp group (#3089) 8d9a8ca roofline: code cleanup and refactor vector types (#3813) 8957e49 Don't wait on command completion if worker thread is destroyed (#3790) 9e7586a [rocshmem] Add barrier APIs and expose `ROCSHMEM_TEAM_WORLD` on device (#3651) 91b0923 Revert "fix local gpu release static build failure (#3667)" (#3799) 0fda754 libhsakmt: Add secondary KFD context creation support ee43db9 Revert "Update TheRock reference to 20260303 commit (#3709)" (#3826) 86e28b9 Added fix to update GL2C counters instance count for GFX11.5 (#3100) 93f69f7 Adjust includes to match use (#3742) e9fbc3f (develop) Update TheRock reference to 20260303 commit (#3709) be0675a (HEAD) Revert "Support fp8 types in hiprtc (#2605)" (#3792) 3e3a94a [rocprofiler-systems] Add trace_cache support for std::optional<T> serialization (#3490) 0b42a7f clr: Eliminate unnecessary kernel name string copies (#3774) b6b0d77 rocr: Add hsa_amd_memory_async_batch_copy API for batched memory copies (#3259) 486e6d1 Resolve staircase RS regression with 48 max channels (#3684) eb59c85 [gfx942][gfx950] Leverage new cache bypass builtins for simple protocol where available (#2847) 4d74d27 (origin/users/raramakr/rocm-smi-target) Revert "Auto Labeler: Add ci:regression-detection label to rccl PRs (#3543)" (#3769) 8f07955 [AMD-SMI] CI: Use ASIC-specific test blacklists in workflows (#3775) 7cef5b6 Fix MFMA total FLOPS calculation (#3371) aea3751 Remove duplicated tests (#3235) b6c656f Remove duplicated tests in memory module (#3087) ca3137d [rocprofiler-sdk] Install integration tests without building for therock & Misc. fixes (#3047) 0ab5c41 [rdc] Enable on-demand queue mode in rocprofiler-sdk to prevent inference degradation (#3629) a1eb2a1 rocr/wsl: a library should not output to std::out by default (#3718) b7da296 Reenable flood_put/get testers on mlx5 since they should work after pr2732 (#3748) 000e24d [rocprofiler-sdk] Add automatic late-start support to rocprofiler_force_configure (#2168) 64ea87f [hip-tests] Fix memory leaks in hipMemPoolTrimTo tests (#3643) 543a7d7 rocr: Include code object allocs in lightweight coredump a58da37 [rocdecode] - update rocdecode ctest (#3768) f88e4ee [rocprofiler-systems] Make CDash submit non-fatal and add GitHub Actions logging (#3525) cb14deb [rocprofiler-systems] Update nlohmann-json submodule (#3391) 4492530 SWDEV-567112 - Introduce new mechanism for tagging and disabling tests - Part 2 (#3707) 8ca9913 disabling rccl from full build (linux), covered in RCCL CI (#3770) c4fdb20 [ATT] Re-enable tests. Add option to specify perf to target CU only (#2819) 615aab9 ROCM-3816 Out of Memory fix (#3588) 8ffad41 Fix rocm_smi64 exporting invalid absolute paths to consumers (#3717) 042d76a rocr: Remove dependency on KFD in Runtime::VMemoryHandleMap (#2515) 555db59 [AMD-SMI] CPU: Added support for family 1A Models 50h-57h (#3206) 3affa2c [SWDEV-555935] Fix shared mutex and self-heal (#3729) ba0bf0f Replace hipMemGetInfo with ihipMemGetInfo and use it for internal calls. (#2845) c5cef9b Fix HIP_RETURN on all HIP API calls. (#2838) 241ce7b Revert "memory: fix "contiguous_bytes" calculation in generic conversion (#3285)" (#3755) 8a690f4 [kpack/clr] Windows PE/COFF support for kpack artifact splitting and runtime loading (#3728) 863bdf8 MFMA pre-processor guards for ipc.hip (#3724) 90bb9b1 Release queue outside of vgpusAccess lock (#3705) de45239 clr: Add build support of ROCR and PAL backends together (#3722) dfb7abc [rocprofiler-sdk] RCCL API changes for RCCL_API_TRACE_VERSION_PATCH = 3 (#3477) d69d4f2 [AICOMRCCL-633] - Fixed warnings in tests (#3402) 067d86d rocr/wsl: Disable AQL Queue usage with flag ROCR_USE_PM4 (#3663) 594eb60 [TheRock CI] rocm-systems build full ROCm stack (#3182) 27d17e8 [ROCProfiler-SDK] Fix SWDEV-556922: Handle comments before checking for pmc: (#1723) c80d904 memory: fix "contiguous_bytes" calculation in generic conversion (#3285) 669987c [hip-tests] ASAN - add missing release handles (#3735) a24bbd7 fix local gpu release static build failure (#3667) 259b2ff Speed up DeviceId (#2803) 65d9264 Simplify MPI trace merge logic and remove legacy guards (#3562) 1076c08 use system to look for zcat path instead (#3720) 22f1d19 [AICOMRCCL-355] Enable threshold-based p2p-batching (#3000) a2e4c79 Partially flatten template tests cases (#2597) e242abe Pass space separated gfx target list to RCCL build command (#3701) 4f78aea SWDEV-570074 - Refactor Memset memory object handling. (#2228) b3ad12d Support Nvidia build on theRock for HIP-tests (#3335) a1cf15e Support fp8 types in hiprtc (#2605) 8ef84b0 [rocprofiler-systems] Add HPC examples to automated testing (#3437) db3a70d Free memory which was allocated in tests (#3710) 27e6809 [rocprofiler-systems]: Fix rhel CI failure on for MPI and UCX tests (#3700) 0d9aaf5 rccl/topo_expl: fix build issue. (#3719) be04d75 Fix zcat path used for checking kernel configs (#3423) cab60a7 rocr/thunk/win: Add CU mask support (#3518) 5b3d826 [CUMEM] Initial support for cuMem APIs (#2763) 0606ff4 [HIP] [PLAT-194496] Improve Stress_hipMalloc_HighSizeAlloc reliability (#3550) 05750a7 fix hip-test name in config (#3716) 33f777f hsakmt: Remove --high functionality from run_kfdtest.sh (#2486) e4c46e3 Hide the retain under direct dispatch check (#3698) bfe0ca0 Add rocprof trace decoder to CI tests (#3690) a769b6f [rocSHMEM] Edgar/abstract allocator ipc part1 (#3411) 659fb52 [AMD-SMI] Fix bugs, improve error handling, and clean up NIC/switch code (#3654) 0eb26ea hsakmt: Fix Import/Export of dmabuf_fd for WSL/Windows (#3348) a122936 [SWDEV-567812] Add UBB power and power_limit fields to npm_info (#3262) c3bec09 [rocprofiler-sdk][rocprofv3][rocpd] Updates for KFD data (#340) 7c44d47 SWDEV-547659 - Remove HIP_VERSION_GITHASH in logs (#448) 74b6487 SWDEV-547008 - Documentation fix for function return values (#463) af21cd4 SWDEV-545553 - Improve clarity and robustness of CALLBACK unit tests (#546) 180d639 SWDEV-544900 - Change hip-test test case name (#547) feeca99 Doc improvements (#3688) c1822b6 ROCprofiler-SDK: deprecation of legacy tools (#3609) 5d7aff8 Fix rocprof-compute-viewer link (#3459) 0b0b484 AIRUNTIME-129 - Fix Ocl test failures of 2D image with pitches. (#3584) ac569b8 Fix memory tests config (#3687) 603fe7a [hip-tests] Enable hipMipmappedArrayGetMemoryRequirements test via cmake 4fad445 [hip] Docs: Updates to some memory management pages 8cc5955 AICOMRCCL-656 fix memory leak in ncclCommInitRankFunc (#3628) 94a4595 Fix missing amd_comgr linkage in pc-sampling integration test (#3453) 2a68565 rocrtst: CMAke file: strip xnack/feature suffixes from gfxNum in build_kernel (#3652) c3542bf [rocprofv3] Deprecating input text files for counter collection (#1562) ff122e7 SWDEV-573073 - Cleanup hipHostAlloc/Malloc/Register tests (#3017) 5b1deaf SWDEV-567112 - Introduce new mechanism for tagging and disabling tests - Part 1 - Core (#2351) 6e0cc30 rocrtst: MaxSingleAllocationTest: skip CPU NUMA nodes >0 (#3208) d65f601 [AICOMRCCL-667] rccl: Change GDR selection logic. (#3607) f1c44ab Patch Back to Old Repo: fixes from manual runs (#3621) fe53bcd [AMD-SMI] Allow amdsmi init to succeed when no NIC hardware is present (#3403) b25600e [ROCM SMI] Fix fw pldm version not displayed in default amd-smi (#3594) 169d2ef root to module wiring, remove legacy source collection (#3482) 7469781 [LRT][clr] SWDEV-512963-Fix CTS test failures for 1D buffer copy (#3520) c8f55d9 Adding rocprof trace decoder (#3576) 425e983 Trace decoder codeowners (#3600) a176efd [hip-tests] Add return statements to HIP_SKIP_TEST (#3647) 32687cf rocrtst: CPUAccessToGPUMemoryTest: Cap host allocation to 512 MB under ASAN (#3407) 97c0206 Update codeowners for thunk DXG (#3334) be44b28 [rocdecode][rocjpeg] - ctest CMakeLists cleanup (#3632) 80ff0b8 Various memory leak fixes in hip-tests (#3605) 0988f67 fix typo in help text (#3314) 9f823c5 Fix CUID file lookup by loading files before searching entries (#3436) 064c892 SWDEV-546177 - hipModuleGetLoadingMode API impl (#653) 006213e ROCM-2696: Ignare size and base if null ptr (#3336) 6060b99 Improve atomic min max test perf (#2580) 3fbcc13 Change printf capture impl (#1127) 93bc019 (tag: hip-version_7.12.60610, origin/users/mradosav-amd/rocprofsys-selective-region) [ROCM-CORE] Update rdhc script to support rocm install prefix (ROCm/rocm-systems#3596) [AICOMRCCL-355]: https://amd-hub.atlassian.net/browse/AICOMRCCL-355?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Fixes #1559.
This is a draft PR meant to show the full integration on a pull request. Commits will be split off from this for review:
upload_python_packages.py, similar topost_build_upload.py#3136Test runs on artifacts from prior CI workflow runs (https://github.com/ROCm/TheRock/actions/runs/21486753577 on e9da873):