-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Omni erroneously removes a live etcd member #750
Comments
FWIW, I also ran into this issue when upgrading from Talos 1.8.1 to 1.8.4. I didn't do anything special, just updated the cluster template and ran
To "fix" it, I rebooted and selected the "Reset installation and reboot into maintenance mode" option for the 100.64.0.77 node. It eventually came back up. I made a support bundle with omnictl before resetting the node. Would it be helpful to send somewhere? |
Did you have any extensions installed on the nodes? |
The logic of the etcd audit got outdated with the more recent Talos versions. `apid` now runs in the states where it wasn't available before, so the check for the etcd member might lead to the false-positives. Also reorder the `auditMember` check sequence to be more correct. Fixes: siderolabs#750 Signed-off-by: Artem Chernyshev <[email protected]>
The logic of the etcd audit got outdated with the more recent Talos versions. `apid` now runs in the states where it wasn't available before, so the check for the etcd member might lead to the false-positives. Also reorder the `auditMember` check sequence to be more correct. Fixes: siderolabs#750 Signed-off-by: Artem Chernyshev <[email protected]>
The logic of the etcd audit got outdated with the more recent Talos versions. `apid` now runs in the states where it wasn't available before, so the check for the etcd member might lead to the false-positives. Also reorder the `auditMember` check sequence to be more correct. Fixes: #750 Signed-off-by: Artem Chernyshev <[email protected]> (cherry picked from commit 82da2f4)
I've identified the root cause. The fix has landed in 0.45.1. |
#745
The text was updated successfully, but these errors were encountered: