Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Serve] Enable multiple ports in SkyServe replicas #4356

Open
wants to merge 9 commits into
base: master
Choose a base branch
from

Conversation

Conless
Copy link
Contributor

@Conless Conless commented Nov 14, 2024

Current implementation of SkyServe only allows the replicas to expose one port. In some cases, services may need to expose multiple ports for custom controller, GUI interface, etc. This PR adds support for it by allowing multiple ports, but requiring user to specify one as the main port in the service section, if multiple ports are provided.

service:
  readiness_probe: /
  replicas: 2
  port: 8000

resources:
  ports:
  - 8000-8001
  - 3000
  cpus: 2

The output of sky serve status will look like:

Services
NAME              VERSION  UPTIME  STATUS  REPLICAS  ENDPOINT            
sky-service-xxxx  1        28s     READY   2/2       xxxx:30001  

Service Replicas
SERVICE_NAME      ID  VERSION  ENDPOINT             LAUNCHED     RESOURCES       STATUS  REGION     
sky-service-xxxx  1   1        http://xxxx:8000     50 secs ago  1x AWS(vCPU=2)  READY   us-east-1  
sky-service-xxxx  2   1        http://xxxx:8000     1 min ago    1x AWS(vCPU=2)  READY   us-east-1 

while the other ports (8001, 3000) are still accessible.

Tested (run the relevant ones):

  • Code formatting: bash format.sh
  • Any manual or new tests for this PR (please specify below)
    • sky serve up, sky serve status and accessing the service with config:
    service:
      readiness_probe: /
      replicas: 2
      port: 8000
    
    resources:
      ports:
      - 8000-8001
      - 3000
      cpus: 2
    
    run: |
      python -m http.server 8000 &
      python -m http.server 8001 &
      python -m http.server 3000 &
  • All smoke tests: pytest tests/test_smoke.py
  • Relevant individual smoke tests: pytest tests/test_smoke.py::test_fill_in_the_name
  • Backward compatibility tests: conda deactivate; bash -i tests/backward_compatibility_tests.sh

@Conless
Copy link
Contributor Author

Conless commented Nov 14, 2024

Hi @cblmemo ! Would you like to have a look at this?

Copy link
Collaborator

@cblmemo cblmemo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding this feature @Conless ! Mostly LGTM. Left a discussion.

Also we might want to update related documents for this :))

sky/cli.py Outdated Show resolved Hide resolved
@cblmemo
Copy link
Collaborator

cblmemo commented Nov 18, 2024

Can we also update the PR description?

sky/cli.py Outdated Show resolved Hide resolved
@cblmemo
Copy link
Collaborator

cblmemo commented Nov 18, 2024

Also, it will be great if we can test on some real world usage (e.g. deploy an LLM service and expose metric port in vLLM or dashboard in ray

sky/cli.py Show resolved Hide resolved
@cblmemo
Copy link
Collaborator

cblmemo commented Jan 14, 2025

@Conless Lets change the API to:

service:
  ports: xxx

So in the future, this can be extended to:

service:
  ports:
    - 8000,9000 # Indicating there will be one LB for all IP + Port pairs, if Port == 8000 or 9000
    - 10000-10100,7888 # if port in [10000,10100] or port == 7888
    - 7000 # Indicating a LB for all IP:7000

Could you help make this API change? Also, it will be great to add a smoke test, update the related doc page and resolve the merge conflict. After that this should be ready to go!

@Conless
Copy link
Contributor Author

Conless commented Jan 15, 2025

@Conless Lets change the API to:

service:
  ports: xxx

So in the future, this can be extended to:

service:
  ports:
    - 8000,9000 # Indicating there will be one LB for all IP + Port pairs, if Port == 8000 or 9000
    - 10000-10100,7888 # if port in [10000,10100] or port == 7888
    - 7000 # Indicating a LB for all IP:7000

Could you help make this API change? Also, it will be great to add a smoke test, update the related doc page and resolve the merge conflict. After that this should be ready to go!

Sure! Let me checkout to this version and add the tests

task.service.set_port(service_port)
else:
port_set = set()
for requested_resources in list(task.resources):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets assert service port is in every request resources;s ports field?

'Must only specify one port in resources. Each replica '
'will use the port specified as application ingress port.')
service_port_str = requested_resources.ports[0]
if not service_port_str.isdigit():
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pls double check if we need this

assert task_resources.ports is not None
return task_resources.ports[0]
# Already checked all ports are valid in sky.serve.core.up
assert len(task.resources) >= 1, task
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
assert len(task.resources) >= 1, task
assert task.resources, task

@@ -25,6 +25,7 @@ def __init__(
readiness_timeout_seconds: int,
min_replicas: int,
max_replicas: Optional[int] = None,
port: Optional[int] = None,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
port: Optional[int] = None,
ports: Optional[str] = None,

change name to ports for future compatibiility?

@@ -388,6 +388,9 @@ def get_service_schema():
},
}
},
'port': {
'type': 'integer',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
'type': 'integer',
'type': 'str',

'echo "Accessing endpoints $endpoint, $endpoint_alt"; '
'curl $endpoint | grep "Hi, SkyPilot here"; '
'curl $endpoint_alt | grep "Hi, SkyPilot here"',
],
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets also curl the load balancer endpoint?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants