Skip to content

Commit

Permalink
Update docs-gb/kubernetes/hpa-rps-autoscaling.md
Browse files Browse the repository at this point in the history
  • Loading branch information
lc525 authored Dec 4, 2024
1 parent fb45bd1 commit e829041
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs-gb/kubernetes/hpa-rps-autoscaling.md
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,7 @@ data:
{% endcode %}

In this example, a single rule is defined to fetch the `seldon_model_infer_total` metric
from Prometheus, compute its rate over a 1 minute window, and expose this to k8s as the `infer_rps`
from Prometheus, compute its rate over a 1 minute window, and expose this to Kubernetes as the `infer_rps`
metric, with aggregations available at model, server, inference server pod and namespace level.

When HPA requests the `infer_rps` metric via the custom metrics API for a specific model,
Expand Down

0 comments on commit e829041

Please sign in to comment.