-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[rawhide][x86_64] : kola tests fail due to systemd pkg upgrade #1857
Comments
Pinned systemd to systemd-257-1.fc42 |
This "init.scope" is very strange. It would seem that journald was not able to figure out how PID1 is called. This may happen for other processes if they die very quickly, but of course this doesn't apply to PID1. I have never seen this before. I looks like EROFS is the original error. There is nothing in v257.2 that looks related. There was a commit in v257.1 (systemd/systemd@1f6e192848). Are you sure that it's a regression between .1 and .2? |
I was made aware that I misread the initial report and it's a regression from 257. So indeed systemd/systemd@1f6e192848 is probably the culprit. The question is what is special in this environment to trigger the issue. |
What's special is likely that we create |
Hmm, if |
Or at least, we can in some scenarios. Though note for Anaconda installs, we run systemd-tmpfiles for a select set of paths before rebooting (because anaconda has Humm though one interesting thing here actually is that while the FCOS container we do have
The qcow2 disk image doesn't somehow:
Which is quite surprising to me because ostree as of recently does automatically copy data from |
In the scenario under test here, the filesystem is writable - So what would need to be handled is ENOENT - which may be what you meant. |
I think this is probably https://bugzilla.redhat.com/show_bug.cgi?id=2334015 ? edit: well, hmm, it's a bit different as you're getting "Read-only file system" not "Permission denied"...probably caused by the same upstream change, though. |
... (rhbz#2334015, coreos/fedora-coreos-tracker#1857)
systemd-257.2-4 has the offending patch reverted. |
I checked the kola tests locally with systemd-257.2-4.fc42 and they seem to PASS. |
Opened a fast-track PR for this - coreos/fedora-coreos-config#3316 |
Should be taken care of with coreos/fedora-coreos-config#3316 |
This is the cause of https://bugzilla.redhat.com/show_bug.cgi?id=2339009 |
I will say here that the lockfile stuff makes total sense in isolation but the only long term sustainable solution is actually to support reverting package builds always. For exactly things like this, systemd should have been reverted everywhere not just pinned in FCOS. Not just because we have other deliverables (e.g. bootc) that aren't pinning (semi intentionally) today but just because pinning in one place creates confusing technical debt that FCOS maintainers need to remember to clean up - which is exactly what happened in this case. So again we should be aiming towards requiring PRs for packages, and having reasonable gating there. |
We had a PR to drop the override that slipped through the cracks apparently. coreos/fedora-coreos-config#3318 I've merged that now. |
openQA would usually have gated something like this, but weirdly the bug was specific to certain environments (Cloud and CoreOS, not sure what the common attribute is?) and we don't run any tests of those on updates, so the tests all passed :/ We do test service startup on server, KDE and workstation environments, so if they'd been affected the update would've been gated. In the event, we only caught it when it landed in a compose and the cloud tests ran. |
Describe the bug
Multiple kola tests in the rawhide build seem to fail because of the systemd pkg upgrade
systemd 257-1.fc42 -> 257.1-1.fc42
Listing few of the kola test failures below
The console log carried messages regarding failure to start Network Name Resolution.
Reproduction steps
cosa fetch && cosa build
Expected behavior
ext.config.toolbox (including other tests) kola tests to pass
Actual behavior
Multiple Kola tests fail with error
[2025-01-02T20:29:19.241Z] harness.go:1823: mach.Start() failed: machine "d8d8cd2b-e47e-428c-92d9-920a3df98f67" failed basic checks: detected failed or stuck systemd units: some systemd units failed: systemd-resolved.service; <nil>
System details
rawhide - x86_64
Butane or Ignition config
No response
Additional information
console.txt
journal.txt
The text was updated successfully, but these errors were encountered: