Minimal example of running LLama with Kubernetes

Uses the ollama/ollama image from Docker Hub.

Prerequisites

You can find an example using kind here: Kubernetes and You

helm upgrade --install ollama ./ollama -f ./ollama/values.yaml -n ollama

pip install -r example-app/requirements.txt
python ./example-app/app.py

Visit http://localhost:8999 in your browser to receive a motivational llama message.

An example real time RAG pipeline is provided. We use Redis as a vector database and document cache.

Assuming you have the provided kind cluster running locally. You can use the following to install a redis Helm chart:

helm -n redis install redis oci://registry-1.docker.io/bitnamicharts/redis --create-namespace