Update README.md #127

gibbs-cullen · 2023-10-30T18:40:39Z

Problem

Describe the purpose of this change. What problem is being solved and why?

Solution

Describe the approach you took. Link to any relevant bugs, issues, docs, or other resources.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

Describe specific steps for validating this change.

nc2112

Looks good

igiloh-pinecone

@gibbs-cullen Thank you very much for your efforts!!

Please see a few comments and suggestions.
I guess most of the comments probably go back to @miararoy's original phrasing. But some of the new additions might also be a bit inaccurate \ misleading.

README.md

igiloh-pinecone · 2023-10-31T08:01:06Z

README.md

+* **Easy to implement:** Bring your text data in Parquet or JSONL format, and Canopy will handle the rest. Canopy is currently compatible with any OpenAI API endpoint. 
+* **Reliable at scale:** Build fast, highly accurate GenAI applications that are production-ready and backed by Pinecone’s vector database. Seamlessly scale to billions of items with transarent, resource-based pricing. 
+* **Open and flexible:** Fully open-source, Canopy is both modular and extensible. Deploy as a service or a library, and choose the components you need. Easily incorporate it into existing OpenAI applications and connect Canopy to your preferred UI. 
+* **Interactive and iterative:** Chat with your text data using a simple command in the Canopy CLI. Easily compare RAG vs. non-RAG workflows side-by-side to interactively evaluate the augmented results before scaling to production. 


I think we should emphasize in our phrasing that as a development tool, you can use the CLI to experiment with the chat service.

We got feedback from our internal reviewer that they understood that Canopy is a CLI tool, where in fact it isn't (it's a chatbot backend).

+1 we should focus the evaluative element as to be more 'Interactive Evaluation' - Evaluate your RAG workflow with a CLI based chat debug tool.

igiloh-pinecone · 2023-10-31T08:02:47Z

README.md


-By enhancing language models with access to unlearned knowledge and inifinite memory we can build AI applications that can answer questions and assist humans without the risk of hallucinating or generating fake content. Let's learn how Canopy executes RAG pipeline.
+Learn how Canopy implemenets the full RAG workflow to prevent hallucinations and augment you LLM (via an OpenAI endpoint) with your own text data. 

 ![](.readme-content/rag_flow.png)


Personally, I still believe this drawing is too complex for our front page. We should make a much simpler one, highlighting the key features, and save the actual detailed flow to the "advanced" section.

Agreed; we still need to updated the diagram

igiloh-pinecone · 2023-10-31T08:07:59Z

README.md

-    * **ChatEngine** _`/chat/completions`_  - is a complete RAG unit that exposes a chat interface of LLM augmented with retrieval engine.
-    * **ContextEngine** _`/context/query`_ - is a proxy between your application and Pinecone. It will handle the R in the RAG pipeline and will return the snippet of context along with the respected source. 
-    * **KnowledgeBase** _`/context/{upsert, delete}` -  is the data managment interface, handles the processing, chunking and encoding (embedding) of the data, along with Upsert and Delete operations
+1. **Canopy Core Library** - Canopy has 3 API level components that are responsible for different parts of the RAG workflow:


We're intermixing classes and APIs here, while the latter doesn't belong.

The core library has python "Classes" like ContextEngine.
The Canopy Server has "API endpoints", like /context/query. These don't belong in the section describing the core library.

noted, we can separate those

so for the server, you can run any of these "classes" vis the api endpoints?

I'll make some changes and commit them to this branch, if you don't mind.

igiloh-pinecone · 2023-10-31T08:08:48Z

README.md

+    * **ChatEngine** _`/chat/completions`_  - implements the full RAG workflow and exposes a chat interface to interact with your data. It acts as a wrapper around the Knowledge Base and Context Engine.
+    * **ContextEngine** _`/context/query`_ - performs the “retrieval” part of RAG. It rewrites and transforms your queries into query embeddings before finding the most relevant results (including citations) from Pinecone to pass along to your LLM prompt (via an OpenAI endpoint). 
+
+    * **KnowledgeBase** _`/context/{upsert, delete}` -  prepares your data for the RAG workflow. It automatically chunks and transforms your text data into text embeddings before upserting them into the Pinecone vector database. It also handles Delete operations.

 > more information about the Core Library usage can be found in the [Library Documentation](docs/library.md)

 2. **Canopy Service** - a webservice that wraps the **Canopy Core** and exposes it as a REST API. The service is built on top of FastAPI, Uvicorn and Gunicorn and can be easily deployed in production. 


I think we should adopt the term "Canopy Server" rather than "Canopy Service".
That's also how it's called in the code.

@miararoy we should probably change the CLI prints accordingly

README.md

igiloh-pinecone · 2023-10-31T08:26:19Z

README.md

+## Considerations
+
+* Canopy is currently only compatiable with OpenAI API endpoints for both the embedding model and the LLM.  Rate limits and pricing set by OpenAI will apply. 
+

 ## Setup



That's further down the README - but I think we need to put a hard stop between steps 3 and 4 of the "Quick start".
After step 3, you have a functioning, ready made Canopy server. We should stop there and say something like "your server is ready to be deployed as a Chatbot backend!".

Then the "Chat with your data" should become an optional step (not part of the actual Quickstart) - saying something like, "to immediately explore and experiment with your server, you can use the built-in Chat tool".

Otherwise, again, the REAME might create the impression that Canopy itself is a CLI tool, meant to be used from CLI (which is counter-productive).

yes, i'm going to leave the quickstart part to you plus Byron

@miararoy \ @nc2112 your thoughts?
Do I have a green light to do this change?

Yes - I agree. I think Step 3 is when your canopy server is ready to go. Left this feedback in my notes as well.

The following section can explain how to evaluate, it is not a step 4 in getting started, per se.

miararoy

Looks great, nothing to add on @igiloh-pinecone
I approve

README.md

igiloh-pinecone · 2023-10-31T08:56:39Z

README.md

+    * **ChatEngine** _`/chat/completions`_  - implements the full RAG workflow and exposes a chat interface to interact with your data. It acts as a wrapper around the Knowledge Base and Context Engine.
+    * **ContextEngine** _`/context/query`_ - performs the “retrieval” part of RAG. It rewrites and transforms your queries into query embeddings before finding the most relevant results (including citations) from Pinecone to pass along to your LLM prompt (via an OpenAI endpoint). 
+
+    * **KnowledgeBase** _`/context/{upsert, delete}` -  prepares your data for the RAG workflow. It automatically chunks and transforms your text data into text embeddings before upserting them into the Pinecone vector database. It also handles Delete operations.


The KnowledgeBase is also responsible for part of the retrieval.
Given a textual query, the KnowledgeBase is responsible for retrieving the most relevant document chunks. Then the ContextEngine is responsible for aggregating all this retrieved information in to one coherent textual context.

(I'm not sure if we should mention that here, that might be too nuanced. But it is also inaccurate to mention the KnowledgeBase as only doing upsert, since it's a crucial part of the query process as well)

We can add more of those details here; these blurbs were from the blog so more high level

so chat engine also transforms queries into query embeddings?

README.md

Applied the suggestions I made.

…into gibbs-cullen-patch-1

Update README.md

0adbe9f

gibbs-cullen requested review from miararoy and igiloh-pinecone October 30, 2023 18:40

nc2112 approved these changes Oct 30, 2023

View reviewed changes

igiloh-pinecone reviewed Oct 31, 2023

View reviewed changes

miararoy approved these changes Oct 31, 2023

View reviewed changes

igiloh-pinecone reviewed Oct 31, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

igiloh-pinecone reviewed Oct 31, 2023

View reviewed changes

gibbs-cullen added this pull request to the merge queue Oct 31, 2023

github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Nov 1, 2023

igiloh-pinecone reviewed Nov 1, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

Apply suggestions from code review

fae21a3

Applied the suggestions I made.

igiloh-pinecone changed the base branch from main to dev November 1, 2023 16:45

Commit one more suggestion

50946b0

igiloh-pinecone enabled auto-merge November 1, 2023 16:49

igiloh-pinecone added 2 commits November 1, 2023 18:50

Merge remote-tracking branch 'origin/dev' into gibbs-cullen-patch-1

79b73bf

Merge branch 'gibbs-cullen-patch-1' of github.com:pinecone-io/canopy …

35c437c

…into gibbs-cullen-patch-1

igiloh-pinecone force-pushed the gibbs-cullen-patch-1 branch from 033c646 to 35c437c Compare November 1, 2023 16:54

igiloh-pinecone added this pull request to the merge queue Nov 1, 2023

Merged via the queue into dev with commit 4f145b3 Nov 1, 2023
10 checks passed

igiloh-pinecone deleted the gibbs-cullen-patch-1 branch November 1, 2023 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update README.md #127

Update README.md #127

gibbs-cullen commented Oct 30, 2023

nc2112 left a comment

igiloh-pinecone left a comment •

edited

Loading

igiloh-pinecone Oct 31, 2023 •

edited

Loading

nc2112 Oct 31, 2023

igiloh-pinecone Oct 31, 2023

gibbs-cullen Oct 31, 2023

igiloh-pinecone Oct 31, 2023 •

edited

Loading

gibbs-cullen Oct 31, 2023

igiloh-pinecone Oct 31, 2023

igiloh-pinecone Oct 31, 2023

igiloh-pinecone Oct 31, 2023

gibbs-cullen Oct 31, 2023

igiloh-pinecone Oct 31, 2023

nc2112 Oct 31, 2023

miararoy left a comment

igiloh-pinecone Oct 31, 2023

gibbs-cullen Oct 31, 2023

gibbs-cullen Nov 1, 2023

Update README.md #127

Update README.md #127

Conversation

gibbs-cullen commented Oct 30, 2023

Problem

Solution

Type of Change

Test Plan

nc2112 left a comment

Choose a reason for hiding this comment

igiloh-pinecone left a comment • edited Loading

Choose a reason for hiding this comment

igiloh-pinecone Oct 31, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

igiloh-pinecone Oct 31, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

miararoy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

igiloh-pinecone left a comment •

edited

Loading

igiloh-pinecone Oct 31, 2023 •

edited

Loading

igiloh-pinecone Oct 31, 2023 •

edited

Loading