-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dsc failed to honor a client stop request #1390
Comments
On the buildomat instance, we can see that the downstiars processes are all running:
And, looking at logs for both 8810 and 8820, the look to be the same:
Which suggests that the downstairs did start up correctly. |
So, this is odd, from
We see that log message for all downstairs
But, we only see this second log message for clients 1,2 and 3, not for client 0. But, the process listing does show downstairs client 0 (8810) as running. |
A pstack on the buildomat host does show one thread waiting on a
|
Same issue here: #1498 |
With the update to 1.84, we should no longer see this. |
In a CI run, the test-up-encrypted.sh hung wile doing the replace-reconcile test
(original CI link, may be broken: https://github.com/oxidecomputer/crucible/pull/1387/checks?check_run_id=27758312216
)
Copy of logs from the buildomat instance are here: /staff/tmp/buildomat-crucible-20240722/
Looking at the dsc log:
Normal so far, we are starting up 4 downstairs at the given locations,
However, only three downstairs make it to running, 8810 (client 0) we don't get a message for.
A client requests a stop of 0:
But, dsc does not stop it (it's not running I suspect so dsc has no action to take).
The client then comes back and keeps asking for state waiting for it to be stopped:
This stopped state never happens, so the test loops forever.
The text was updated successfully, but these errors were encountered: