Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llama cpp server cant open to public #6268

Open
Kev1ntan opened this issue Mar 24, 2024 · 3 comments
Open

llama cpp server cant open to public #6268

Kev1ntan opened this issue Mar 24, 2024 · 3 comments
Labels
enhancement New feature or request good first issue Good for newcomers server/webui

Comments

@Kev1ntan
Copy link

Kev1ntan commented Mar 24, 2024

Darwin Feedloops-Mac-Studio.local 23.2.0 Darwin Kernel Version 23.2.0: Wed Nov 15 21:55:06 PST 2023; root:xnu-10002.61.3~2/RELEASE_ARM64_T6020 arm64

example my public ip is: http://36.54.42.112

step to reproduce:

  1. python -m http.server --bind 0.0.0.0 8082, can be access from localhost:8082 and http://36.54.42.112:8082
  2. ./server -m ../models/mistral-7b-openorca.Q8_0.gguf -c 2048 --host 0.0.0.0 --port 8082 -ngl 33 -cb -np 32
    can be access from localhost:8082/v1/models but cant access from http://36.54.42.112:8082/v1/models

any insight?, thank you.

@phymbert
Copy link
Collaborator

HI,

please verify the network family it listens to, ipv4 or ipv6 ?

We had the issue on the server test. Probably need to add a flag to select ipv4 only.

addrs = socket.getaddrinfo(context.server_fqdn, context.server_port, type=socket.SOCK_STREAM)

@phymbert
Copy link
Collaborator

We usually do not expose directly the server to internet, I am using docker or kubernetes and the container has only one socket familty to listen to.
Feel free to open a PR to configure the good socket flags.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers server/webui
Projects
None yet
Development

No branches or pull requests

2 participants