Merge pull request #27 from ritual-net/sr/feat/remove_infernet_ml

Remove infernet_ml
ritual-net · Oct 28, 2024 · 5bdf085 · 5bdf085
2 parents 9b69f49 + 3a30ceb
commit 5bdf085
Show file tree

Hide file tree

Showing 58 changed files with 578 additions and 2,486 deletions.
diff --git a/.gitignore b/.gitignore
@@ -45,3 +45,9 @@ remote_sync
 
 # secrets
 *-key.json
+
+# Virtual envs
+**/env/**
+
+# Arweave keyfile
+keyfile-*.json
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -5,6 +5,12 @@ All notable changes to this project will be documented in this file.
 - ##### The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/).
 - ##### This project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [2.0.0] - 2024-10-28
+
+### Changed
+- Simplified examples to the minimum core functionality necessary and removed all dependencies on `infernet-ml`.
+- Updated images used for deploying the Infernet Node.
+
 ## [1.0.2] - 2024-07-31
 
 ### Changed

diff --git a/Makefile b/Makefile
@@ -7,7 +7,7 @@ build-container:
 
 remove-containers:
 	docker compose -f deploy/docker-compose.yaml down || true
-	docker stop $(project) anvil-node && docker rm $(project) anvil-node || true
+	docker stop $(project) infernet-anvil && docker rm $(project) infernet-anvil || true
 
 build-multiplatform:
 	$(MAKE) -C ./projects/$(project)/container build-multiplatform

diff --git a/README.md b/README.md
@@ -14,6 +14,6 @@ model to infernet. Using this example will make it easier for you to deploy your
 4. [Prompt to NFT](projects/prompt-to-nft/prompt-to-nft.md): In this example, we use [stablediffusion](https://github.com/Stability-AI/stablediffusion) to
  mint NFTs on-chain using a prompt.
 5. [TGI Inference with Mistral-7b](projects/tgi-llm/tgi-llm.md): This example shows you how to deploy an arbitrary
-LLM model using [Huggingface's TGI](https://huggingface.co/docs/text-generation-inference/en/index), and use it with an infernet node.
+LLM model using [Huggingface's TGI](https://huggingface.co/docs/text-generation-inference/en/index), and use it with an Infernet Node.
 6. [Running OpenAI's GPT-4 on Infernet](projects/gpt4/gpt4.md): This example shows you how to deploy OpenAI's GPT-4 model
 to infernet.
diff --git a/deploy/docker-compose.yaml b/deploy/docker-compose.yaml
@@ -1,6 +1,6 @@
 services:
   node:
-    image: ritualnetwork/infernet-node:1.0.0
+    image: ritualnetwork/infernet-node:1.3.1
     ports:
       - "0.0.0.0:4000:4000"
     volumes:
@@ -31,6 +31,7 @@ services:
       - redis-data:/data
     restart:
       on-failure
+    container_name: infernet-redis
 
   fluentbit:
     image: fluent/fluent-bit:3.1.4
@@ -45,6 +46,7 @@ services:
       - network
     restart:
       on-failure
+    container_name: infernet-fluentbit
 
   infernet-anvil:
     image: ritualnetwork/infernet-anvil:1.0.0

diff --git a/projects/gpt4/container/Dockerfile b/projects/gpt4/container/Dockerfile
@@ -18,7 +18,7 @@ ADD https://astral.sh/uv/install.sh /install.sh
 RUN chmod 755 /install.sh
 RUN /install.sh && rm /install.sh
 
-COPY src/requirements.txt .
+COPY requirements.txt .
 
 RUN /root/.cargo/bin/uv pip install --system --no-cache -r requirements.txt
 

diff --git a/projects/gpt4/container/README.md b/projects/gpt4/container/README.md
@@ -1,20 +1,42 @@
 # GPT 4
-In this example, we run a minimalist container that makes use of our closed-source model
-workflow: `CSSInferenceWorkflow`. Refer to [src/app.py](src/app.py) for the
-implementation of the quart application.
+
+In this example, we will run a minimalist container that makes use of the OpenAI [completions API](https://platform.openai.com/docs/api-reference/chat) to serve text generation requests.
+
+Check out the full tutorial [here](https://learn.ritual.net/examples/running_gpt_4).
 
 ## Requirements
-To use the model you'll need to have an OpenAI api key. Get one at
-[OpenAI](https://openai.com/)'s website.
+
+To use the model you'll need to have an OpenAI API key. Get one on [OpenAI](https://openai.com/)'s website.
+
+## Build the Container
+
+Simply run the following command to build the container:
+
+```bash
+make build
+```
 
 ## Run the Container
 
+To run the container, you can use the following command:
+
 ```bash
 make run
 ```
 
 ## Test the Container
+
+You can test the container by making inference requests directly via your terminal:
+
 ```bash
 curl -X POST localhost:3000/service_output -H "Content-Type: application/json" \
   -d '{"source": 1, "data": {"prompt": "can shrimps actually fry rice?"}}'
 ```
+
+## Next steps
+
+This container is for demonstration purposes only, and is purposefully simplified for
+readability and ease of comprehension. For a production-ready version of this code, check out:
+
+- The [CSS Inference Workflow](https://infernet-ml.docs.ritual.net/reference/infernet_ml/workflows/inference/css_inference_workflow/): A Python class that supports multiple API providers, including OpenAI, and can be used to build production-ready containers.
+- The [CSS Inference Service](https://infernet-services.docs.ritual.net/reference/css_inference_service/): A production-ready, [Infernet](https://docs.ritual.net/infernet/node/introduction)-compatible container that works out-of-the-box with minimal configuration, and serves inference using the `CSS Inference Workflow`.
diff --git a/projects/gpt4/container/config.sample.json b/projects/gpt4/container/config.sample.json
@@ -18,8 +18,10 @@
       "allowed_sim_errors": []
     },
     "snapshot_sync": {
-      "sleep": 3,
-      "batch_size": 100
+      "sleep": 1.5,
+      "batch_size": 50,
+      "starting_sub_id": 0,
+      "sync_period": 1
     }
   },
   "startup_wait": 1.0,
@@ -43,7 +45,7 @@
       "allowed_ips": [],
       "command": "--bind=0.0.0.0:3000 --workers=2",
       "env": {
-        "OPENAI_API_KEY": "your-key"
+        "OPENAI_API_KEY": "your-openai-key"
       },
       "volumes": [],
       "accepted_payments": {},

diff --git a/projects/gpt4/container/requirements.txt b/projects/gpt4/container/requirements.txt
@@ -0,0 +1,2 @@
+quart==0.19.4
+web3==6.15.0
diff --git a/projects/gpt4/container/src/app.py b/projects/gpt4/container/src/app.py
@@ -3,29 +3,15 @@
 from typing import Any, cast
 
 from eth_abi import decode, encode  # type: ignore
-from infernet_ml.utils.css_mux import (
-    ConvoMessage,
-    CSSCompletionParams,
-    CSSRequest,
-    Provider,
-)
-from infernet_ml.utils.service_models import InfernetInput
-from infernet_ml.utils.service_models import JobLocation
-from infernet_ml.workflows.inference.css_inference_workflow import CSSInferenceWorkflow
 from quart import Quart, request
+import requests
 
 log = logging.getLogger(__name__)
 
 
 def create_app() -> Quart:
     app = Quart(__name__)
 
-    workflow = CSSInferenceWorkflow(
-        api_keys={Provider.OPENAI: os.environ["OPENAI_API_KEY"]}
-    )
-
-    workflow.setup()
-
     @app.route("/")
     def index() -> str:
         """
@@ -35,70 +21,76 @@ def index() -> str:
 
     @app.route("/service_output", methods=["POST"])
     async def inference() -> Any:
-        req_data = await request.get_json()
         """
-        InfernetInput has the format:
+        Input data has the format:
             source: (0 on-chain, 1 off-chain)
+            destination: (0 on-chain, 1 off-chain)
             data: dict[str, Any]
         """
-        infernet_input: InfernetInput = InfernetInput(**req_data)
+        req_data: dict[str, Any] = await request.get_json()
+        onchain_source = True if req_data.get("source") == 0 else False
+        onchain_destination = True if req_data.get("destination") == 0 else False
+        data = req_data.get("data")
 
-        match infernet_input:
-            case InfernetInput(source=JobLocation.OFFCHAIN):
-                prompt = cast(dict[str, Any], infernet_input.data).get("prompt")
-            case InfernetInput(source=JobLocation.ONCHAIN):
-                # On-chain requests are sent as a generalized hex-string which we will
-                # decode to the appropriate format.
-                (prompt,) = decode(
-                    ["string"], bytes.fromhex(cast(str, infernet_input.data))
-                )
-            case _:
-                raise ValueError("Invalid source")
+        if onchain_source:
+            """
+            For on-chain requests, the prompt is sent as a generalized hex-string
+            which we will decode to the appropriate format.
+            """
+            (prompt,) = decode(["string"], bytes.fromhex(cast(str, data)))
+        else:
+            """For off-chain requests, the prompt is sent as is."""
+            prompt = cast(dict[str, Any], data).get("prompt")
 
-        result = workflow.inference(
-            CSSRequest(
-                provider=Provider.OPENAI,
-                endpoint="completions",
-                model="gpt-4-0613",
-                params=CSSCompletionParams(
-                    messages=[
-                        ConvoMessage(
-                            role="system", content="you are a helpful " "assistant."
-                        ),
-                        ConvoMessage(role="user", content=cast(str, prompt)),
-                    ]
-                ),
-            )
+        # Make request to the OpenAI API to get a completion of the prompt.
+        # See https://platform.openai.com/docs/api-reference/chat for more info.
+        api_key = os.environ["OPENAI_API_KEY"]
+        response = requests.post(
+            "https://api.openai.com/v1/chat/completions",
+            headers={
+                "Content-Type": "application/json",
+                "Authorization": f"Bearer {api_key}",
+            },
+            json={
+                "model": "gpt-4-0613",
+                "messages": [
+                    {"role": "system", "content": "you are a helpful assistant."},
+                    {"role": "user", "content": cast(str, prompt)},
+                ],
+            },
         )
 
-        match infernet_input:
-            case InfernetInput(destination=JobLocation.OFFCHAIN):
-                """
-                In case of an off-chain request, the result is returned as is.
-                """
-                return {"message": result}
-            case InfernetInput(destination=JobLocation.ONCHAIN):
-                """
-                In case of an on-chain request, the result is returned in the format:
+        # Ensure the request was successful, and get the result.
+        response.raise_for_status()
+        result = response.json()
+        content = result["choices"][0]["message"]["content"]
+
+        # Depending on the destination, the result is returned in a different format.
+        if onchain_destination:
+            """
+            For on-chain requests, the result is returned in the format:
                 {
                     "raw_input": str,
                     "processed_input": str,
                     "raw_output": str,
                     "processed_output": str,
                     "proof": str,
                 }
-                refer to: https://docs.ritual.net/infernet/node/containers for more
-                info.
-                """
-                return {
-                    "raw_input": "",
-                    "processed_input": "",
-                    "raw_output": encode(["string"], [result]).hex(),
-                    "processed_output": "",
-                    "proof": "",
-                }
-            case _:
-                raise ValueError("Invalid destination")
+            refer to: https://docs.ritual.net/infernet/node/advanced/containers for more
+            info.
+            """
+            return {
+                "raw_input": "",
+                "processed_input": "",
+                "raw_output": encode(["string"], [content]).hex(),
+                "processed_output": "",
+                "proof": "",
+            }
+        else:
+            """
+            For off-chain request, the result is returned as is.
+            """
+            return {"message": content}
 
     return app
 

diff --git a/projects/gpt4/container/src/requirements.txt b/projects/gpt4/container/src/requirements.txt
diff --git a/projects/gpt4/contracts/README.md b/projects/gpt4/contracts/README.md
@@ -2,7 +2,7 @@
 
 This is a minimalist foundry project that implements a [callback consumer](https://docs.ritual.net/infernet/sdk/consumers/Callback)
 that makes a prompt to the [container](../container/README.md), which then makes a call to OpenAI's GPT4. For an
-end-to-end flow of how this works, follow the [guide here](../gpt4.md).
+end-to-end flow of how this works, follow our [GPT4 tutorial](https://learn.ritual.net/examples/running_gpt_4).
 
 ## Deploying