Skip to content

cognitivecomputations/runpod-vllm

Repository files navigation

runpod vllm worker

Optional - build base image locally

docker build -t ehartford/runpod-worker-vllm:latest .

Bake model into image

docker build -t ehartford/runpod-worker-vllm:ondemand --build-arg MODEL_NAME="cognitivecomputations/dolphin-2.6-mixtral-8x7b" --build-arg MODEL_BASE_PATH="/model/" -f Dockerfile-ondemand .
docker push ehartford/runpod-worker-vllm:ondemand

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published