Default upgradeJobResources values are too low and get the UpgradeCRD job OOMKilled #515

BaudouinH · 2023-11-02T18:19:19Z

What steps did you take and what happened:

Upgrading the Velero Helm chart from velero-5.1.2 to velero-5.1.3 fails:
the Upgrade CRD job cannot complete due to the pods of the job being OOMKilled when using the chart default resource values.

What did you expect to happen:

The upgrade CRD job using the Chart default resource values should not be OOMKilled.

The output of the following commands will help us better understand what's going on:
(Pasting long output into a GitHub gist or other pastebin is fine.)

Anything else you would like to add:
The job resumed working as intended when I bumped it with the following resource requirements:

upgradeJobResources:
  requests:
    cpu: 100m
    memory: 128Mi
  limits:
    cpu: 1000m
    memory: 512Mi

I am not sure if the OOMKill is due to my environment or because the default values are too low.

I would gladly do the PR if you want, but I am not sure that the values I provided are the best values.

Also, I often hear that setting CPU limits are an anti-pattern and should be avoided, is there a particular reason you set default CPU limit values in this chart instead of making them optional?

Environment:

helm version (use helm version): 3.11.2 (packaged with ArgoCD)
helm chart version and app version (use helm list -n <YOUR NAMESPACE>):
Chart version: velero-5.1.3
App version: v1.12.1
Kubernetes version (use kubectl version):

Client Version: v1.28.2
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.27.4-gke.900

Kubernetes installer & version:

GKE 1.27.4

Cloud provider or hardware configuration:
Google Cloud Platform
OS (e.g. from /etc/os-release):

Ubuntu 22.04.3 LTS

The text was updated successfully, but these errors were encountered:

reefland · 2023-11-03T12:23:15Z

I only had to bump memory a little....

upgradeJobResources:
  requests:
    cpu: 50m
    memory: 128Mi
  limits:
    cpu: 100m
    memory: 256Mi

I have a small 6 node home lab environment and the default values couldn't handle that.

jenting · 2023-11-10T16:46:46Z

How about remove the request/limit default value within values.yaml?
Leave the configuring request/limit value to the user.

makarov-roman · 2023-11-11T11:16:31Z

It was added here: #514
I personally do not see any benefits having it.

reefland · 2023-11-11T16:30:27Z

It should be commented out and left for the user to enable. If the default is known to be invalid, then bump up the default. No reason to set the default to something known to OOM kill. That's just mean.

jenting · 2023-11-12T02:36:23Z

@reefland Would you mind file a PR to address it?

…something generous that actually works. Revert this once vmware-tanzu/helm-charts#515 is resolved Signed-off-by: Jan Lohage <[email protected]>

…limits .upgradeJobResources defaults are too low (128Mi limit). set them to something generous that actually works. Revert this once vmware-tanzu/helm-charts#515 is resolved (#1168) Signed-off-by: Jan Lohage <[email protected]>

jenting added enhancement New feature or request velero labels Nov 8, 2023

j2L4e mentioned this issue Nov 20, 2023

Temporarily fix upstream issue with velero-upgrade-crds pod's memory limits YAKEcloud/yake#1168

Merged

ishuar mentioned this issue Dec 2, 2023

[velero]: Disable default values for upgradeJobResources #524

Merged

4 tasks

qiuming-best closed this as completed in #524 Dec 4, 2023

jenting assigned BaudouinH Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default upgradeJobResources values are too low and get the UpgradeCRD job OOMKilled #515

Default upgradeJobResources values are too low and get the UpgradeCRD job OOMKilled #515

BaudouinH commented Nov 2, 2023

reefland commented Nov 3, 2023

jenting commented Nov 10, 2023 •

edited

Loading

makarov-roman commented Nov 11, 2023

reefland commented Nov 11, 2023

jenting commented Nov 12, 2023

Default upgradeJobResources values are too low and get the UpgradeCRD job OOMKilled #515

Default upgradeJobResources values are too low and get the UpgradeCRD job OOMKilled #515

Comments

BaudouinH commented Nov 2, 2023

reefland commented Nov 3, 2023

jenting commented Nov 10, 2023 • edited Loading

makarov-roman commented Nov 11, 2023

reefland commented Nov 11, 2023

jenting commented Nov 12, 2023

jenting commented Nov 10, 2023 •

edited

Loading