llproxy/spec/llproxy.md at main · the-crypt-keeper/llproxy #867
Labels
AI-Agents
Autonomous AI agents using LLMs
AI-Chatbots
Topics related to advanced chatbot platforms integrating multiple AI models
hosting-services
llm model hosting services
llm
Large Language Models
llm-applications
Topics related to practical applications of Large Language Models in various fields
llm-inference-engines
Software to run inference on large language models
software-engineering
Best practice for software engineering
source-code
Code snippets
llproxy/spec/llproxy.md at main · the-crypt-keeper/llproxy
Snippet
"Let's create a new NodeJS project called LLProxy.
The goal of this project will be a self-configuring proxy that discovers large language model servers and route requests to them in a fleixble way.
Configuration
A json configuration file will be provided in the following format:
Data Structures
Endpoints
All endpoints start in /v1 to be compliant with open-ai API specification.
The /v1/models endpoint will be special and handled by the proxy itself.
The /v1/completions and /v1/chat/completions endpoints will use the 'model' field of the request body to route the request (more on that later)
Model Discovery
Every 30 seconds, the following process should run in a background task and rebuild the active list of models:
tags
from this endpoint to create the final model names in the formname:tag
Emit appropriate debug information as the process runs and a summary when it is complete.
Model Endpoint
The /v1/models endpoint should return the current active list of models. Our 'id' is the
name:tag
final model name from above, but pass through all other model information from the endpoint itself.Completions endpoints
/v1/completions and /v1/chat/completions proxies should look at 'model' which will be the
name:tag
and use it to look-up in the active list of models the real endpoint and model id.It should overwrite the 'model' with the true model id and then proxy the request to the backend endpoint.
Note that HTTP response streaming is supported on both of these endpoints, so the proxy should be non-blocking and asyncronous."
Suggested labels
None
FragmentEditor
We are going to be creating a new React application called FragmentEditor.
The purpose of the application is to help a user write, edit and revise stories, documents, emails or any other text quickly with keyboard shortcuts and AI assistance.
Use create-react-app to generate the scaffolding, then provide all the required changes to the source code to implement the requirements below.
Concepts
Fragment: A small piece of text, usually but not necessarily a sentence
Fragment list: A list of fragments that when joined together forms the current document.
Selected fragment: The index of the currently selected fragment in the list. It should be possible to select a new, not-yet-created fragment at the very end of the list.
Document: The fragment list joined into a single string without introducing any additional whitespace.
Mode: The current editor mode: "explore" (default), "edit" or "insert"
The Fragment List
The main display area, the fragment list, should take up 95% of the screen in both width and height the rest should be padding.
Inside this area, render the current list of fragments. Each fragment should be rendered visually distinctly with the selected fragment highlighted using background color. Remember fragments may contain newlines, but should otherwise render tightly with adjacent fragments.
It should possible to select a new, not-yet-created fragment at the very end of the list which should render as
<new>
with a visually distinct background.Keyboard Controls
In the default "explore" mode, the following keyboard controls should be available:
In "edit" and "insert" modes, the Selected Fragment should be rendered as a textarea. If entered on the
<new>
fragment, the text should start empty and if the user accepts it with enter the fragment should be created and the selected fragment again moved to another<new>
.The following keyboard controls should be available in "edit" and "insert" mode:
Starting state
We should start in "explore" mode.
The fragment list should be empty.
The
<new>
virtual fragment at the end of the list should be selected.The text was updated successfully, but these errors were encountered: