server : add some missing env variables #9116

ngxson · 2024-08-21T09:57:05Z

I forgot LLAMA_ARG_HOST and LLAMA_ARG_PORT

As a nice-to-have, LLAMA_ARG_HF_REPO and LLAMA_ARG_MODEL_URL are also added. Although it's not used by HF inference endpoint, it will be useful if someone want to deploy llama.cpp to stateless/server-less platforms like heroku or google cloud run.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Nexesenex · 2024-08-23T23:25:32Z

This overall feature is very useful!
Would it be possible to add params.rope_scaling_type and the other rope related parameters?

ngxson · 2024-08-24T12:14:07Z

@Nexesenex Currently we can't pass enum as environment variable, so for now I can't add rope_scaling_type.

The hacky solution is to duplicate the code from gpt_params_find_arg, but I don't feel like it's worth doing so. Probably there will be a follow-up refactoring PR in the future to bring more variables to env.

Nexesenex · 2024-08-24T12:27:36Z

@ngxson I tried and reached the problem, hence my request.
Thanks for the hacky hint! I will try to implement it for myself for the time being.

* server : add some missing env variables * add LLAMA_ARG_HOST to server dockerfile * also add LLAMA_ARG_CONT_BATCHING

ngxson added 2 commits August 21, 2024 11:45

server : add some missing env variables

3748c73

add LLAMA_ARG_HOST to server dockerfile

b3eed89

ngxson requested a review from ggerganov August 21, 2024 09:57

also add LLAMA_ARG_CONT_BATCHING

ff0f3e8

github-actions bot added examples devops improvements to build systems and github actions server labels Aug 21, 2024

ggerganov approved these changes Aug 24, 2024

View reviewed changes

ngxson merged commit a77feb5 into ggerganov:master Aug 27, 2024
52 checks passed

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

server : add some missing env variables (ggerganov#9116)

9ad9ed1

* server : add some missing env variables * add LLAMA_ARG_HOST to server dockerfile * also add LLAMA_ARG_CONT_BATCHING

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

server : add some missing env variables (ggerganov#9116)

6c4ba55

* server : add some missing env variables * add LLAMA_ARG_HOST to server dockerfile * also add LLAMA_ARG_CONT_BATCHING

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

server : add some missing env variables (ggerganov#9116)

a690da1

* server : add some missing env variables * add LLAMA_ARG_HOST to server dockerfile * also add LLAMA_ARG_CONT_BATCHING

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : add some missing env variables #9116

server : add some missing env variables #9116

ngxson commented Aug 21, 2024

Nexesenex commented Aug 23, 2024

ngxson commented Aug 24, 2024

Nexesenex commented Aug 24, 2024

server : add some missing env variables #9116

server : add some missing env variables #9116

Conversation

ngxson commented Aug 21, 2024

Nexesenex commented Aug 23, 2024

ngxson commented Aug 24, 2024

Nexesenex commented Aug 24, 2024