Replicate is a platform for running machine learning models in the cloud from your own code. Browse models.
For detailed usage, setup, and more, here is the official Replicate Documentation for Node.js.
When prompting Llama, user inputs are wrapped with [INST]
and [/INST]
tags to signal the beginning and end of user input. This formatting helps Llama distinguish between its responses and user messages.
System prompts are used to guide or constrain the model's behavior.
For more tips and a comprehensive guide on how to prompt Llama, read this guide to prompting Llama 2.