[Autoscaler][V2] Use running node instances to rate-limit upscaling #50414

ryanaoleary · 2025-02-11T06:44:16Z

Why are these changes needed?

This PR changes the v2 autoscaler scale up logic to use IMInstance.RAY_RUNNING (rather than IMInstance.REQUESTED + nodes with an IMInstance.cloud_instance_id) when computing the number of nodes to launch per node type. This is to maintain consistency with how upscaling_speed is handled by the V1 autoscaler and described in the Ray docs, i.e. the factor by which to scale the number of nodes in the cluster, with a minimum of 5.

Related issue number

Closes #50259

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Ryan O'Leary <[email protected]>

ryanaoleary and others added 5 commits February 11, 2025 06:42

Use running instances to rate-limit upscaling

699d88a

Signed-off-by: Ryan O'Leary <[email protected]>

Merge branch 'master' into update-v2-scale-up

3677008

Add import

f9bf403

Signed-off-by: Ryan O'Leary <[email protected]>

remove unused var

934dfe7

Signed-off-by: Ryan O'Leary <[email protected]>

Fix naming

1d8aa48

Signed-off-by: Ryan O'Leary <[email protected]>

ryanaoleary marked this pull request as ready for review February 11, 2025 08:36

ryanaoleary requested review from hongchaodeng and a team as code owners February 11, 2025 08:36

edoakes requested a review from kevin85421 February 11, 2025 13:59

jcotant1 added the core Issues that should be addressed in Ray Core label Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Autoscaler][V2] Use running node instances to rate-limit upscaling #50414

[Autoscaler][V2] Use running node instances to rate-limit upscaling #50414

ryanaoleary commented Feb 11, 2025 •

edited

Loading

[Autoscaler][V2] Use running node instances to rate-limit upscaling #50414

Are you sure you want to change the base?

[Autoscaler][V2] Use running node instances to rate-limit upscaling #50414

Conversation

ryanaoleary commented Feb 11, 2025 • edited Loading

Why are these changes needed?

Related issue number

Checks

ryanaoleary commented Feb 11, 2025 •

edited

Loading