You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After creating and running a Pod with resource requests, the allocatable device count on the node does not change as expected, even though the Pod is running successfully. Additionally, when attempting to create another Pod requesting the same resources, the new Pod remains in a Pending state.
Actual behavior:
The allocatable resources on the node did not change after the first Pod was running successfully. The second Pod, which requested the same resources, remained in a Pending state, suggesting that the resources were not available, but the allocatable count did not reflect this.
The text was updated successfully, but these errors were encountered:
I don't see any evidence that this is a k3s issue. None of the affected code lives in this project. Have you verified that this only affects k3s?
Most likely you should report this to the upstream Kubernetes project, or Nvidia operator. Probably the latter since that is what is responsible for tracking the GPU resource.
That's said, I don't think that is how the allocatable resources work. If I remember correctly, it is not a measure of what remains after resources currently in use by pods are subtracted. It the total usable by the kubelet, after the kubelet and system reserved resources are subtracted.
You should read the Kubernetes docs for more information on what these fields indicate.
Environmental Info:
K3s Version:
k3s version v1.28.14+k3s1 (3ef2bdb)
Environment="K3S_ARGS= --cluster-cidr=10.44.0.0/16 --disable-network-policy --disable-cloud-controller --disable=local-storage,servicelb,traefik --kubelet-arg=eviction-hard=memory.available<5%,nodefs.available<10% --kubelet-arg=kube-reserved=cpu=200m,memory=250Mi,ephemeral-storage=1Gi --kubelet-arg=system-reserved=cpu=200m,memory=250Mi,ephemeral-storage=1Gi --kube-proxy-arg=proxy-mode=ipvs
Describe the bug:
After creating and running a Pod with resource requests, the allocatable device count on the node does not change as expected, even though the Pod is running successfully. Additionally, when attempting to create another Pod requesting the same resources, the new Pod remains in a Pending state.
pod resources detals :
node details:
Steps To Reproduce:
Expected behavior:
Actual behavior:
The allocatable resources on the node did not change after the first Pod was running successfully. The second Pod, which requested the same resources, remained in a Pending state, suggesting that the resources were not available, but the allocatable count did not reflect this.
The text was updated successfully, but these errors were encountered: