Skip to content

Commit

Permalink
Merge branch 'main' into prompt-test
Browse files Browse the repository at this point in the history
  • Loading branch information
prashantgupta24 authored Aug 20, 2024
2 parents d6e6f1c + 19adb9d commit 3438634
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 1 deletion.
6 changes: 5 additions & 1 deletion Dockerfile.ubi
Original file line number Diff line number Diff line change
Expand Up @@ -187,6 +187,10 @@ RUN --mount=type=cache,target=/root/.cache/pip \
ENV HF_HUB_OFFLINE=1 \
PORT=8000 \
HOME=/home/vllm \
# Allow requested max length to exceed what is extracted from the
# config.json
# see: https://github.com/vllm-project/vllm/pull/7080
VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 \
VLLM_USAGE_SOURCE=production-docker-image \
VLLM_WORKER_MULTIPROC_METHOD=fork

Expand All @@ -210,4 +214,4 @@ RUN --mount=type=cache,target=/root/.cache/pip \

ENV GRPC_PORT=8033
USER 2000
ENTRYPOINT ["python3", "-m", "vllm_tgis_adapter"]
ENTRYPOINT ["python3", "-m", "vllm_tgis_adapter", "--uvicorn-log-level=warning"]
1 change: 1 addition & 0 deletions extras/smoke-test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ function wait_for(){
max_retries=$((max_retries-1))
if [[ max_retries -le 0 ]]; then
echo "Timed out waiting for $name server" >&2
kill -9 ${server_pid}
exit 1
fi
done
Expand Down

0 comments on commit 3438634

Please sign in to comment.