-
-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple CI jobs broken on master
#749
Comments
Ah, sorry, I should've checked more carefully - it seems like these tests have been broken for a while now, so nothing to get alarmed about. Still, at least some of those errors seem legit (e.g., the first one complaining about a missing symbol) |
Yeah, seems most of the AMDGPU CI fails but still tests status shows success. |
That's a crash in building AMDGPU.jl
That's #725.
I think that's a bug in MVAPICH which may have been fixed in a later version. |
More specifically, the problem is that an old version of AMDGPU.jl (0.2.17) is picked up, despite Line 22 in 4ba36fa
This should have been fixed in newer versions. |
I'll check more carefully but to me it seems that none of the AMDGPU CI succeeds. |
AMDGPU.jl 0.3 isn't compatible with Julia v1.6. I don't see how we can possibly make it work. We should either drop support for Julia v1.6 or not use AMDGPU at all. |
Call me naive, but I don't see how anyone still uses MPI.jl + Julia v1.6 for productive workloads. For the sake of reducing maintenance efforts, I'd thus vote for dropping v1.6 support. If this is not possible/desirable, then disable AMDGPU support for Julia v1.6 by adding a corresponding statement to the docs and removing the corresponding test matrix entries. |
I'd say that AMDGPU Ext + MPI should be bound to Julia 1.9 & AMDGPU 0.5 as explained in #753 |
AMDGPU is always a test dependency: Line 2 in abebf65
|
I think we have now cleared most of if not all the persistent failures by either addressing them or skipping consistently failing tests, which are already tracked by specific issues. If there are other relevant issues it'd be good to open dedicated issues, but I'm going to close this ticket, as CI should now be in a much better shape. |
It seems like multiple CI jobs are broken, e.g.,
ERROR: LoadError: UndefVarError: libhsa_runtime64 not defined
(upstream bug in AMDGPU.jl 0.2 with Julia v1.6, but AMDGPU v0.3 requires Julia v1.7). It doesn't seem to occur anymore.MPIR_Attr_delete_list
spack/spack#39058)get_position_shared
#555). Test skipped in Skip threads tests which are known to fail #791Any idea what's going on here?
The text was updated successfully, but these errors were encountered: