-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix CI hangs on win32 #34225
Fix CI hangs on win32 #34225
Conversation
After much debugging, it turns out that OpenBlas is sometimes hanging on exit due to bad assumptions in its thread management. The upstream discussion is at OpenMathLib/OpenBLAS#2350, but this should fix our frequent win32 CI failures in the meantime.
Looks like the BB OpenBLAS was upgrade to 0.3.7, but we never imported that into Julia? 0.3.7 needs a rebased patch. What's the right way to proceed here? @staticfloat |
This is the patch from JuliaLang/julia#34225, but against 0.3.7 not 0.3.5.
This is the patch from JuliaLang/julia#34225, but against 0.3.7 not 0.3.5.
Bump. Should we merge and backport this, then separately upgrade openblas to 0.3.7 and replace the patch? |
@staticfloat what's the status of the 0.3.7 binaries that have this patch? |
They are a part of #34242 (I've also carried your patch for a local from-source build there) |
I'm building you a 0.3.5 version with your patch so that we can land this on 1.3 and 1.4 without needing to change the rest of the BB binaries. It's going to require some manual renaming of tarballs, but should be done pretty soon. JuliaPackaging/Yggdrasil#356 |
Should we do a quick 1.3.2 with this? |
We should include it if we do 1.3.2, but not worth it to do a release just for this. |
People have reported hangs in their windows CI on 1.3, which I thought this might fix. |
I doubt it. It's pretty hard to trigger. The only reason Julia managed to is that we spawn hundreds of julia processes in short succession and immediately kill them. |
Ok. It may be a different win32 issue then. Could you take a quick look at Circuitscape/Omniscape.jl#13 |
Somebody will have to debug that. I don't think it looks like this issue because this issue causes julia to hang on exit (unless of course it's creating additional processes while it goes along). My suggestion would be to use AppVeyor's rdp feature and try to attach a debugger or generate a minidump. |
It's only happening on Travis Windows, IIUC. |
We do need a 1.3.2 with a new Pkg anyway, so it should be on our radar. |
After much debugging, it turns out that OpenBlas is sometimes hanging
on exit due to bad assumptions in its thread management. The upstream
discussion is at OpenMathLib/OpenBLAS#2350, but this
should fix our frequent win32 CI failures in the meantime.