Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] How to recover from provision state failed at Node Pool #4674

Open
jkroepke opened this issue Nov 27, 2024 · 4 comments
Open

[Question] How to recover from provision state failed at Node Pool #4674

jkroepke opened this issue Nov 27, 2024 · 4 comments
Labels

Comments

@jkroepke
Copy link

Describe scenario
We are running multiple AKS and we have weekly automatic patching enabled.

Some of our critical pod have a PDB configured.

In some rate condition, the automatic update failed. The reason is that there is a timeout. A configured PDB, deny to drain a node.

Question

To recover from that situation, i have to manually restart the pod. Thats fine.

However. The node pool remains in Failed state, including the extra nodes remains as well.

How I can recover from that state? How I can re-trigger the automatic update?

One solution is manually delete the old VM from the VMSS. But thats kinda tricky on Node Pools which large amount of nodes.

@JoeyC-Dev
Copy link

JoeyC-Dev commented Dec 2, 2024

az aks update -n $aks -g $rG

No other arguments/parameters.

@jkroepke
Copy link
Author

jkroepke commented Dec 2, 2024

So is no Portal Experience, right? I can't trigger update at portal, if its on the latest version.

@JoeyC-Dev
Copy link

So is no Portal Experience, right? I can't trigger update at portal, if its on the latest version.

From document, yes. Maybe someone else knows it is hidden somewhere.
Image

https://learn.microsoft.com/troubleshoot/azure/azure-kubernetes/availability-performance/cluster-node-virtual-machine-failed-state

@jkroepke
Copy link
Author

jkroepke commented Dec 2, 2024

Thanks, I will try that on next incident!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants