2.9.0 osx testing#446
Conversation
Update for 2.9.0, and rebase patches. Signed-off-by: Michał Górny <mgorny@quansight.com>
The tested assertion started failing on AArch64 cross builds, make it print the actual value to aid debugging (now and in the future). Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
Our old patch is no longer sufficient for cross-compilation, the new code checks for PYTORCH_BLAS_USE_CBLAS_DOT envvar though. Signed-off-by: Michał Górny <mgorny@quansight.com>
It looks like we now need to explicitly pass CUDA target include and `libcuda.so` stub directories while building inductor. Introduce a substitution `@CUDA_TARGET@` value in the patch, and replace it with appropriate target in `build.sh`. This leaves the unsubstituted `@CUDA_TARGET@` path on Windows, but that shouldn't do any harm -- the path will simply not exist. Signed-off-by: Michał Górny <mgorny@quansight.com>
Inductor now started requiring fmt headers, and since we are moving includes from site-packages to the top-level include directory, having PyTorch install fmt headers there would conflict with system fmt install. However, PyTorch nowadays uses fmt 12, so let's just use the system library instead. This uses a WIP patch submitted upstream along with a quick hack to make kineto build. Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
Upstream now does not build fbgemm on Windows by default, and if we try to force building it, it just fails. Signed-off-by: Michał Górny <mgorny@quansight.com>
This is required by fmt headers, and upstream is doing it explicitly in PyTorch's `CMakeLists.txt`. I suppose all dependent projects will have to follow suit. Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
…and conda-forge-pinning 2025.11.05.12.33.05 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
Signed-off-by: Michał Górny <mgorny@quansight.com>
|
Hi! This is the friendly automated conda-forge-linting service. I wanted to let you know that I linted all conda-recipes in your PR ( Here's what I've got... For recipe/meta.yaml:
For recipe/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/19195452456. Examine the logs at this URL for more detail. |
|
@h-vetinari, disabling FBGEMM didn't seem to change much. I've now also disabled MKLDNN but TBH I don't think this is the right way forward. Hopefully conda-forge/.cirun#118 + conda-forge/.cirun#119 will let us switch runners. |
|
Now, are the runners busy or is there still something wrong with the labels? |
|
(already fixed wrong bugno in the commit message locally) |
Given that this our first foray into the osx runners, I'm assuming something's not yet configured right. CC @aktech |
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( I do have some suggestions for making it better though... For recipe/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/19606522025. Examine the logs at this URL for more detail. |
|
Okay, that's something new. Either insufficient permissions on the runner, or a bad action script? |
Most likely "insufficient permissions", I'll update the macos image. |
Fixes conda-forge#429 Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.13.12.38.23 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.13.12.38.23 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
Signed-off-by: Michał Górny <mgorny@quansight.com>
Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.14.03.49.41 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
|
@h-vetinari, on a semi-related manner, I suppose we want to start doing "megabuilds" here as well after the switch, right? |
|
Yeah, would be great to use megabuilds for osx too! :) |
Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.14.11.30.28 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.14.11.30.28 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
Signed-off-by: Michał Górny <mgorny@quansight.com>
…5.11.17.12.05.48 Other tools: - conda-build 25.9.0 - rattler-build 0.49.0 - rattler-build-conda-compat 1.4.9
|
I'm going to try something cursed here, to at least confirm if lack of |
|
FYI: I am in the process of upgrading macos runners server to a more powerful one, which means the runners will not spin up for today, migration should be complete tomorrow. |
|
Okay, thanks for telling me before I started frantically trying to figure out why they aren't starting again xP. |
Signed-off-by: Michał Górny <mgorny@quansight.com>
|
@mgorny @h-vetinari The OSX CI is running now, with the new image. (Also fyi reminder: max concurrency is 2, due to Apple's Virtualisation limit on a single machine) |
|
That's amazing, thanks so much @aktech! The OSX CI didn't only run, it produced the builds and tests passed (except on osx-arm64 where we shouldn't run the tests obviously; possibly I'm surprised though, I had thought that the scaleway machines are osx-arm64 already; clearly they're osx-64... I must have gotten this wrong somewhere, sorry. In any case, this should unblock us ~completely. @mgorny, I suggest we don't touch the current setup to get 2.9 out the door, and take care of switching to megabuilds on osx afterwards. |
Nevermind, I just need to look at the config in this PR: build_platform:
osx_64: osx_arm64
provider:
osx_arm64: github_actionsOTOH, that means that our (now native) osx-arm64 builds aren't passing the tests 😑 |
74735f4 to
59a5c1f
Compare
|
OK, with the fixes from #448, the test suite is looking pretty good, except the following two failures: The "unexpected success" is from a highly templated test and AFAICT the failure expectation comes from here. I'll see if skipping does the job |

Splitting to run it independently of the Linux/Windows builds.