lastmile-ai · andrew-lastmile · Feb 21, 2025 · Feb 21, 2025
diff --git a/website/docs/autoeval/experiments.mdx b/website/docs/autoeval/experiments.mdx
@@ -24,6 +24,16 @@ Anything that influences the LLM application's performance is measureable throug
 - Adjusting **system prompts** for an agent  
 - And more
 
+Here are the recommended metadata to track for your LLM application:
+|           | model version        | chunk_size | chunk_strategy | top_k_retrieval | dataset            | temperature | 
+| --------- | -------------------- | ---------  | -------------- | --------------- | ------------------ | ----------- |
+| Example 1 | gpt-4o-2024-08-06    | 1024       | sliding window | 5               | question-answer-v1 | 0           |
+| Example 2 | gemini-2.0-flash-001 | 512        | no overlap     | 5               | qusetion-answer-v1 | 0           |
+
+:::info
+For saving metadata around the model, make sure to include the entire version, which includes the cutoff of the train date. Model providers are constantly updating the latest model with new data, as such it's a good practice to pin the model version on the one that works best for you.
+:::
+
 ## Usage Guide
 This guide walks through the process of setting up and running experiments using AutoEval, including:
 1. **Setting up the API key** and **creating a project**