-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Propolis panic during zone uninstall #827
Comments
I uploaded an instance of the core files for each propolis zone to |
The two instances have been running for some time and most likely didn't have any new processes in them that triggered new guest OS-related failures:
Unfortunately, their propolis logs were destroyed as part of the zone uninstall so all we have for debugging are the core files. |
This is, unfortunate, yes. I wonder if we could, as part of the parking, force an archive/rotation of the logs? Though, if the panic happens at the time of zone uninstall archiving logs is not going to catch that. Perhaps things were in a non normal state that contributed to the panic and the pre-logs would be a clue. As far as I know, we have not seen core files from running propolis during a rack parking previously. As a second thing to try, we could spin up instances on a raclkette and then park it and see if we can get a panic, perhaps while tailing the propolis log from the global zone. Third, it's possible crucible came off the rails if things are not shut down "properly", i.e tasks just dying at unexpected moments and, while not ideal, may not be an actual problem given that we are parking the rack. None the less, it should be understood. |
Perhaps we ought to add an in-memory ring buffer of recent log records, in such a way that it could be fished out easily from the core file? |
Related: |
I found two sets of propolis core files on a certain sled after running the "parking" script that halted and uninstalled all zones on the sled.
Here are the core files of two specific propolis zones - there are 5 occurrences for each of them:
The stacks appear identical and here is one of them:
The text was updated successfully, but these errors were encountered: