Retrieval Augmented Generation

Overview

This is an example Node.js application that utilizes embeddings and the LLaMA model for text retrieval and response generation. It processes a text corpus, generates embeddings for "chunks", and uses these embeddings to performa a "similarity search" in response to queries. The system consists of a node.js server that handles API requests and a p5.js sketch for client interaction.

server.js: Server file that handles API requests and integrates with the Replicate API.
save-embeddings.js: Process a text file and generate embeddings.
test-embeddings.js: Test the embeddings search functionality without all that client server stuff.
embeddings.json: Precomputed embeddings generated from the text corpus.
public/: p5.js sketch
.env: API token

References

How-To

Install Dependencies

npm install

Set up the .env file with your Replicate API token:

REPLICATE_API_TOKEN=your_api_token_here

Generate the embeddings.json file by running save-embeddings.js. (You'll need to hard-code a text filename and adjust how the text is split up depending on the format of your data.)

const raw = fs.readFileSync('text-corpus.txt', 'utf-8');
let chunks = raw.split(/\n+/);

node save-embeddings.js

Run the Server

node server.js

Open browser to: http://localhost:3000 (or whatever port is specified.)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
public		public
.env-sample		.env-sample
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
save-embeddings.js		save-embeddings.js
server.js		server.js
test-embeddings.js		test-embeddings.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrieval Augmented Generation

Overview

References

How-To

About

Releases

Packages

Languages

Programming-from-A-to-Z/Example-RAG-Replicate

Folders and files

Latest commit

History

Repository files navigation

Retrieval Augmented Generation

Overview

References

How-To

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages