A Helm chart for Spotfire® Service for Python
Homepage: https://github.com/spotfiresoftware/spotfire-cloud-deployment-kit
Kubernetes: >=1.24.0-0
Repository | Name | Version |
---|---|---|
file://../spotfire-common | spotfire-common | 0.3.0 |
This chart deploys the Spotfire® Service for Python service (Python service) on a Kubernetes cluster using the Helm package manager.
The Python service pod includes:
- A Fluent Bit sidecar container for log forwarding.
- Service annotations for Prometheus scrapers. The Prometheus server discovers the service endpoint using these specifications and scrapes metrics from the exporter.
- A predefined configuration for horizontal pod autoscaling with KEDA and Prometheus.
This chart is tested to work with Elasticsearch, Prometheus and KEDA.
- A deployed Spotfire Server release using the Spotfire Server chart.
- Export the
SPOTFIRE_SERVER
value to connect to thespotfire-server
service:export SPOTFIRE_SERVER=$(kubectl get services --selector=app.kubernetes.io/part-of=spotfire,app.kubernetes.io/name=spotfire-server --output=jsonpath={.items..metadata.name})
- Forward the logs to the
log-forwarder
service:export LOG_FORWARDER=$(kubectl get services --selector=app.kubernetes.io/part-of=spotfire,app.kubernetes.io/name=log-forwarder --output=jsonpath={.items..metadata.name})
- Install this chart with the release name
my-release
and custom values frommy-values.yaml
:helm install my-release . \ --set acceptEUA=true \ --set global.spotfire.image.registry="127.0.0.1:32000" \ --set global.spotfire.image.pullPolicy="Always" \ --set nodemanagerConfig.serverBackendAddress="$SPOTFIRE_SERVER" \ --set logging.logForwarderAddress="$LOG_FORWARDER" \ -f my-values.yaml
Note: This Spotfire Helm chart requires setting the parameter acceptEUA
(or the parameter global.spotfire.acceptEUA
) to the value true
.
By doing so, you agree that your use of the Spotfire software running in the managed containers will be governed by the terms of the Cloud Software Group, Inc. End User Agreement.
Note: You must provide your private registry address where the Spotfire container images are stored.
See helm install for command documentation.
Configure volumes.packages
to mount a persistent volume, which can contain additional Python packages for the Python service to use. See the Values section for more information on mounting the volume for the packages.
You can populate a packages folder by following the instructions in Installing Python Packages Manually. Here is an example of how to create a packages folder from a requirements.txt file:
python -m pip install --target=/local/path/to/packages -r requirements.txt
Note: For spotfire-pythonservice to use the packages, the packages must be copied to a PersistentVolume that you create and then provide to the helm chart during installation.
The file deploy-packages-to-pv-with-job.yaml is an example of how to use a Kubernetes Job, PersistentVolumeClaim, and ConfigMap to create and populate a PersistentVolume containing Python packages.
You might need to change some values in the file. For example, change the storageClassName for PersistentVolumeClaim to StorageClass from 'nfs-client' to one that exists in your environment. The full set of commands would look something like the following example.
# kubectl will create the PersistentVolumeClaim 'packages-pvc' pointing to PersistentVolume containing the installed Python packages.
kubectl apply . -f deploy-packages-to-pv-with-job.yaml
# When you install the spotfire-pythonservice Helm chart, pass in packages-pvc.
helm install my-release --set volumes.packages=packages-pvc <... additional helm install arguments>
To set Custom configuration properties, add the name of the property as a key under the configuration
section in your Helm values.
Example:
# The maximum number of Python engine sessions that are allowed to run concurrently in the Python service.
engine.session.max: 5
# The number of Python engines preallocated and available for new sessions in the Python service queue.
engine.queue.size: 10
To uninstall/delete the my-release
deployment:
helm uninstall my-release
See helm uninstall for command documentation.
For scaling the my-release
deployment, do a helm upgrade, providing the target number of pod instances in the replicaCount
variable.
helm upgrade --install my-release . --reuse-values --set replicaCount=3
To use KEDA for autoscaling, first install KEDA in the Kubernetes cluster. You must also install a Prometheus instance that scrapes metrics from the Spotfire pods.
Example: A values.yaml
snippet configuration for enabling autoscaling with KEDA:
resources:
limits:
cpu: 5
kedaAutoscaling:
enabled: true
spotfireConfig:
prometheusServerAddress: http://prometheus-server.monitor.svc.cluster.local
threshold: 3
minReplicas: 1
maxReplicas: 3
The spotfire-pythonservice
has the following autoscaling defaults:
- metric:
spotfire_service_queue_engines_inUse
(serviceQueueEnginesInUse Python service counter). - query: the sum of
spotfire_service_queue_engines_inUse
of the Python service instances for the release name.
The counter serviceQueueEnginesInUse provides the total number of engines currently executing.
By default, the Python service has number of cores - 1
available slots, which means that kedaAutoscaling.threshold
should be synchronized with resources.limits.cpu
.
Typically, you want to scale out before all the available capacity is taken.
Therefore, the kedaAutoscaling.threshold
should be lower than resources.limits.cpu
.
Note that clients requesting a slot typically wait until a slot is available.
For more information, see Monitoring Spotfire Service for Python using JMX.
Note: You can tune nodemanagerConfig.preStopDrainingTimeoutSeconds
and other timeouts (for example, engine.execution.timeout
and engine.session.maxtime
) so that long-running jobs are not aborted prematurely when an instance is stopped to scale in.
See Engine Timeout for more details.
For more advanced scenarios, see kedaAutoscaling.advanced and kedaAutoscaling.fallback.
Additionally, you can define your own custom scaling triggers. Helm template functionality is available:
kedaAutoscaling:
triggers:
# {list of triggers to activate scaling of the target resource}
Note: For more details on the autoscaling defaults, see the keda-autoscaling.yaml template.
See helm upgrade for command documentation.
When you upgrade to a newer Spotfire Server version and newer Spotfire services versions, upgrade the Spotfire Server first, and then upgrade the Spotfire services.
Some parameters might have been changed, moved or renamed and must be taken into consideration when upgrading the release. See release notes for more information.
Key | Type | Default | Description |
---|---|---|---|
global.spotfire.acceptEUA | bool | nil |
Accept the Cloud Software Group, Inc. End User Agreement by setting the value to true . |
global.spotfire.image.pullPolicy | string | "IfNotPresent" |
The global container image pull policy. |
global.spotfire.image.pullSecrets | list | [] |
The global container image pull secrets. |
global.spotfire.image.registry | string | nil |
The global container image registry. Used for spotfire/ container images, unless it is overridden. |
acceptEUA | bool | nil |
Accept the Cloud Software Group, Inc. End User Agreement by setting the value to true . |
affinity | object | {} |
|
configuration | object | {} |
Add Custom configuration properties. Keys under configuration should be named the same as the configuration property, for example 'engine.execution.timeout'. |
extraContainers | list | [] |
Additional sidecar containers to add to the service pod. |
extraEnvVars | list | [] |
Additional environment variables. |
extraEnvVarsCM | string | "" |
The name of the ConfigMap containing additional environment variables. |
extraEnvVarsSecret | string | "" |
The name of the Secret containing extra additional environment variables. |
extraInitContainers | list | [] |
Additional init containers to add to the service pod. |
extraVolumeMounts | list | [] |
Extra volumeMounts for the service container. More info: kubectl explain deployment.spec.template.spec.containers.volumeMounts . |
extraVolumes | list | [] |
Extra volumes for the service container. More info: kubectl explain deployment.spec.template.spec.volumes . |
fluentBitSidecar.image.pullPolicy | string | "IfNotPresent" |
The image pull policy for the fluent-bit logging sidecar image. |
fluentBitSidecar.image.repository | string | "fluent/fluent-bit" |
The image repository for fluent-bit logging sidecar. |
fluentBitSidecar.image.tag | string | "3.1.6" |
The image tag to use for fluent-bit logging sidecar. |
fluentBitSidecar.securityContext | object | {} |
The securityContext setting for fluent-bit sidecar container. Overrides any securityContext setting on the Pod level. |
fullnameOverride | string | "" |
|
image.pullPolicy | string | nil |
The spotfire-server image pull policy. Overrides global.spotfire.image.pullPolicy. |
image.pullSecrets | list | [] |
Image pull secrets. |
image.registry | string | nil |
The image registry for spotfire-server. Overrides global.spotfire.image.registry value. |
image.repository | string | "spotfire/spotfire-pythonservice" |
The spotfire-server image repository. |
image.tag | string | "1.21.1-1" |
The container image tag to use. |
kedaAutoscaling | object | {"advanced":{},"cooldownPeriod":300,"enabled":false,"fallback":{},"maxReplicas":4,"minReplicas":1,"pollingInterval":30,"spotfireConfig":{"prometheusServerAddress":"http://prometheus-server.monitor.svc.cluster.local"},"threshold":null,"triggers":[]} |
KEDA autoscaling configuration. See https://keda.sh/docs/latest/concepts/scaling-deployments for more details. |
kedaAutoscaling.cooldownPeriod | int | 300 |
The period to wait after the last trigger reported active before scaling the resource back to 0. |
kedaAutoscaling.maxReplicas | int | 4 |
This setting is passed to the HPA definition that KEDA creates for a given resource and holds the maximum number of replicas of the target resource. |
kedaAutoscaling.minReplicas | int | 1 |
The minimum number of replicas KEDA scales the resource down to. |
kedaAutoscaling.pollingInterval | int | 30 |
The interval to check each trigger on. |
kedaAutoscaling.spotfireConfig | object | {"prometheusServerAddress":"http://prometheus-server.monitor.svc.cluster.local"} |
Spotfire specific settings. |
kedaAutoscaling.spotfireConfig.prometheusServerAddress | string | "http://prometheus-server.monitor.svc.cluster.local" |
REQUIRED. The URL for the Prometheus server from where metrics are fetched. |
livenessProbe.enabled | bool | true |
|
livenessProbe.failureThreshold | int | 10 |
|
livenessProbe.httpGet.path | string | "/spotfire/liveness" |
|
livenessProbe.httpGet.port | string | "registration" |
|
livenessProbe.initialDelaySeconds | int | 60 |
|
livenessProbe.periodSeconds | int | 3 |
|
logging.logForwarderAddress | string | "" |
The spotfire-server log-forwarder name. Template. |
logging.logLevel | string | "debug" |
Set to debug , trace , minimal , or leave empty for info. This applies for both node manager and the service. |
nameOverride | string | "" |
|
nodeSelector | object | {} |
|
nodemanagerConfig.preStopDrainingTimeoutSeconds | int | 610 |
The draining timeout after which the service is forcefully shut down. |
nodemanagerConfig.serverBackendAddress | string | "" |
The spotfire-server service name. This value is evaluated as a helm template. |
podAnnotations."prometheus.io/path" | string | "/spotfire/metrics" |
|
podAnnotations."prometheus.io/port" | string | "9080" |
|
podAnnotations."prometheus.io/scrape" | string | "true" |
|
podSecurityContext | object | {} |
The Pod securityContext setting applies to all of the containers inside the Pod. |
readinessProbe.enabled | bool | false |
|
readinessProbe.failureThreshold | int | 10 |
|
readinessProbe.httpGet.path | string | "/spotfire/readiness" |
|
readinessProbe.httpGet.port | string | "registration" |
|
readinessProbe.initialDelaySeconds | int | 60 |
|
readinessProbe.periodSeconds | int | 3 |
|
replicaCount | int | 1 |
|
resources | object | {} |
|
securityContext | object | {} |
The securityContext setting for the service container. Overrides any securityContext setting on the Pod level. |
service.port | int | 9501 |
|
service.type | string | "ClusterIP" |
|
serviceAccount.annotations | object | {} |
|
serviceAccount.create | bool | false |
|
serviceAccount.name | string | "" |
|
startupProbe.enabled | bool | true |
|
startupProbe.failureThreshold | int | 20 |
|
startupProbe.httpGet.path | string | "/spotfire/started" |
|
startupProbe.httpGet.port | string | "registration" |
|
startupProbe.initialDelaySeconds | int | 60 |
|
startupProbe.periodSeconds | int | 3 |
|
tolerations | list | [] |
|
volumes.packages.existingClaim | string | "" |
When 'persistentVolumeClaim.create' is 'false', then use this value to define an already existing persistent volume claim. |
volumes.packages.persistentVolumeClaim.create | bool | false |
If 'true', then a 'PersistentVolumeClaim' is created. |
volumes.packages.persistentVolumeClaim.resources | object | {"requests":{"storage":"1Gi"}} |
Specifies the standard Kubernetes resource requests and/or limits for the customExt volume claims. |
volumes.packages.persistentVolumeClaim.storageClassName | string | "" |
Specifies the name of the 'StorageClass' to use for the customExt volume-claim. |
volumes.packages.persistentVolumeClaim.volumeName | string | nil |
Specifies the name of the persistent volume that should be used for the customExt volume-claim. |
volumes.packages.subPath | string | "" |
The subPath of the volume to be used for the volume mount |
volumes.troubleshooting.existingClaim | string | "" |
When 'persistentVolumeClaim.create' is 'false', then use this value to define an already existing persistent volume claim. |
volumes.troubleshooting.persistentVolumeClaim.create | bool | false |
If 'true', then a 'PersistentVolumeClaim' will be created. |
volumes.troubleshooting.persistentVolumeClaim.resources | object | {"requests":{"storage":"2Gi"}} |
Specifies the standard Kubernetes resource requests and/or limits for the volumes.troubleshooting claims. |
volumes.troubleshooting.persistentVolumeClaim.storageClassName | string | "" |
Specifies the name of the 'StorageClass' to use for the volumes.troubleshooting-claim. |
volumes.troubleshooting.persistentVolumeClaim.volumeName | string | nil |
Specifies the name of the persistent volume to use for the volumes.troubleshooting-claim. |