This doc provides best practices for writing Kubeflow kustomize packages.
Table of Contents
- Layout package to support composability
- Reuse patches
- Command Line substitution
- Eschew vars
- Have separate packages for CR's and instances of the custom resource
- CommonLabels should be immutable
- Removing common attributes across resources
If your application consists of loosely coupled components e.g. backend, front-end, database consider defining these as separate kustomize packages and then using kustomize to compose these applications into different installs e.g
components/
/app-front
/app-backend
/app-db
installs/
/app-standalone
/app-onprem
Defining separate packages for each component makes it easier to use composition to define new configurations; e.g. using an external database as opposed to a database running in cluster.
We encourage reusing patches across kustomize packages when it makes sense. For example suppose we have an onprem and standalone version of our application but both of them want to reuse a common patch to use an external database. We could lay the packages out like so
components/
/patches/
/deployment-external-db.yaml
installs/
/app-standalone
/app-onprem
The kustomization files for app-standalone could then look like the following
apiVersion: kustomize.config.k8s.io/v1beta1
...
patchesStrategicMerge:
- ../../components/patches/deployment-external-db.yaml
To support the above layout we need to disable kustomizes' security check by running with the load_restrictor
flag:
kustomize build --load_restrictor none $target
To make it easy for users to override command line arguments use the following pattern.
- Use a config map generator to store the parameters
- On Deployments/StatefulSets/etc... set environment variables based on the config map
- Rely on Kubernetes to substitute environment variables into container arguments (ref)
Users can then override the parameters by defining config map overlays.
Using a ConfigMapGenerator and including a content hash is highly prefered over not including a content hash. Using a content hash ensures that rolling updates are triggered if the config map is changed.
Deprecated patterns
- vars should no longer be used to do command line substitution see bit.ly/kf_kustomize_v3
As noted in kubernetes-sigs/kustomize#2052 vars have a lot of downsides. For Kubeflow in particular vars have made it difficult to compose kustomize packages because they need to be unique globally (kubeflow/manifests#1007).
Vars should be used sparingly. Below are some guidance on acceptable use cases.
One ok use case for vars is getting kustomize to subsitute a value into a field kustomize wouldn't normally do substitution into. This often happens with CRDs. For example, consider the virtual service below from jupyter-web-app.
apiVersion: networking.istio.io/v1alpha3
kind: VirtualService
metadata:
name: jupyter-web-app
spec:
gateways:
- kubeflow-gateway
hosts:
- '*'
http:
- ...
route:
- destination:
host: jupyter-web-app-service.$(jupyter-web-app-namespace).svc.$(clusterDomain)
port:
number: 80
We would like kustomize to substitute namespace into the destination host. We do this by
- Defining a vars to get the value for namespace
- Defining a custom configuration so that the vars will be substituted into the virtual service host.
This use of vars is acceptable because the var is internal to the kustomize package and can be given a unique enough name to prevent conflicts when the package is composed with other applications.
One of the most problematic use cases for vars in Kubeflow today is substituting a user supplied value into multiple applications.
Currently we only have one use case which is substituting in cluster domain into virtual services (ref).
We would ultimately like to get rid of the use of vars in these cases but have not settled on precise solutions. Some possible options are
-
Using kpt setters
- kpt is still relatively new and we don't want to mandate/require using it
- consider adding kpt setters as appropriate so users who are willing to use kpt can avoid dealing with vars
-
Defining custom transformers
- e.g. we could define a new transformer for virtual services as discussed in kubeflow/manifests#1007
If you are adding a custom resource (e.g. CertManager) and also defining instances of those resources (e.g. ClusterIssuer) these should be done in separate kustomize packages (see kubeflow/manifests#1121).
Having separate packages makes it easier during deployment to ensure the custom resource is deployed and ready before trying to create instances of the CR.
As noted here commonLabels get applied to selectors which are immutable. Therefore, commonLabels should be immutable across versions of a package to avoid causing problems during upgrades.
For more info see kubeflow/manifests#1131
Resources should be organized by kind, where the resource is in a file that is the lower-case hyphenized form of the Resource kind. For example: a Deployment would go in a file named deployment.yaml. A ClusterRoleBinding would go in a file called cluster-role-binding.yaml. If there are multiple resources within a kustomize target (eg more than one deployment), you may want to maintain a single resource per file and add a prefix|suffix of the resource name to the filename. For example the file name would be <kind>-<name>.yaml
. See below for an example.
example: /manifests/profiles
profiles
└── base
├── README.md
├── cluster-role-binding.yaml
├── crd.yaml
├── deployment.yaml
├── kustomization.yaml
├── role-binding.yaml
├── role.yaml
├── service-account.yaml
└── service.yaml
There are often repeated attributes across resources: labels, namespace, or perhaps a common prefix used for each resource. You can move name prefixes into the kustomization.yaml file and then make adjustments within each resource; removing the prefix from its name. Additionaly you can move labels and their selectors into the kustomization.yaml. Yo can move the namespace into the kustomization.yaml. All of these will be added back into the resource by running kustomize build
.
example: /manifests/profiles/base/kustomization.yaml. Contains namespace, nameprefix, commonLabels.
apiVersion: kustomize.config.k8s.io/v1beta1
kind: Kustomization
resources:
- crd.yaml
- service-account.yaml
- cluster-role-binding.yaml
- role.yaml
- role-binding.yaml
- service.yaml
- deployment.yaml
namespace: kubeflow
namePrefix: profiles-
commonLabels:
kustomize.component: profiles
images:
- name: gcr.io/kubeflow-images-public/profile-controller
newName: gcr.io/kubeflow-images-public/profile-controller
newTag: v20190228-v0.4.0-rc.1-192-g1a802656-dirty-f95773
The original deployment in profiles looked like:
apiVersion: apps/v1
kind: Deployment
metadata:
labels:
kustomize.component: profiles
name: profiles-deployment
namespace: kubeflow
spec:
selector:
matchLabels:
kustomize.component: profiles
template:
metadata:
labels:
kustomize.component: profiles
spec:
containers:
- command:
- /manager
image: gcr.io/kubeflow-images-public/profile-controller:v20190228-v0.4.0-rc.1-192-g1a802656-dirty-f95773
imagePullPolicy: Always
name: manager
serviceAccountName: profiles-controller-service-account
Moving labels, namespace and the nameprefix 'profiles-' to kustomization.yaml reduces deployment.yaml to
apiVersion: apps/v1
kind: Deployment
metadata:
name: deployment
spec:
template:
spec:
containers:
- name: manager
command:
- /manager
image: gcr.io/kubeflow-images-public/profile-controller:v20190228-v0.4.0-rc.1-192-g1a802656-dirty-f95773
imagePullPolicy: Always
serviceAccountName: controller-service-account
Note: A kustomize target should always 'build', so you should add what's needed to allow a kustomize build
to succeed (and for unittests to work). Defining a namespace in kustomization.yaml is required to run kustomize build
, even though there is a namespace override in the parent kustomization.yaml generated by kfctl under /manifests/profiles. This generated kustomization.yaml provides overrides using values from app.yaml and will appear within the manifest cache after running kfctl generate...
.