From 6397b6ed0711f2ca07bc553341f6cc66c7e3b551 Mon Sep 17 00:00:00 2001 From: Courtney Pacheco <6019922+courtneypacheco@users.noreply.github.com> Date: Wed, 11 Dec 2024 11:33:03 -0500 Subject: [PATCH] Update URLs to point to the new URLs for `test-data` files The repo structure was modified in `instructlab/instructlab`, so we need to update the URLs to coincide with that change. Also, we need to add `mbta` to the spellcheck dictionary because the PR build checks think that `mbta` is a misspelled word. Signed-off-by: Courtney Pacheco <6019922+courtneypacheco@users.noreply.github.com> --- .spellcheck-en-custom.txt | 3 ++- docs/dataset_formats.md | 14 +++++++++++--- 2 files changed, 13 insertions(+), 4 deletions(-) diff --git a/.spellcheck-en-custom.txt b/.spellcheck-en-custom.txt index eb773134..1f9c3ab9 100644 --- a/.spellcheck-en-custom.txt +++ b/.spellcheck-en-custom.txt @@ -5,8 +5,8 @@ Backport backported CLI codebase -composable Composable +composable config configs customizable @@ -29,6 +29,7 @@ JSON Langchain's LLM LLMBlock +mbta MCQ Merlinite Mixtral diff --git a/docs/dataset_formats.md b/docs/dataset_formats.md index 0d6ae7cf..ea1bbbbd 100644 --- a/docs/dataset_formats.md +++ b/docs/dataset_formats.md @@ -48,9 +48,17 @@ The fields in a knowledge contribution are: Examples of these files used in CI are found [here](https://github.com/instructlab/instructlab/tree/main/scripts/test-data): -* [e2e-qna-freeform-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-freeform-skill.yaml) -* [e2e-qna-grounded-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-grounded-skill.yaml) -* [e2e-qna-knowledge](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-knowledge.yaml) +* Compositional skills + * Freeform + * [e2e-qna-freeform-palindrome-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/freeform/e2e-qna-freeform-palindrome-skill.yaml) + * [e2e-qna-freeform-siblings-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/freeform/e2e-qna-freeform-siblings-skill.yaml) + * Grounded + * [e2e-qna-grounded-employee-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/grounded/e2e-qna-grounded-employee-skill.yaml) + * [e2e-qna-grounded-punctuation-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/grounded/e2e-qna-grounded-punctuation.yaml) +* Knowledge + * [e2e-qna-knowledge-mbta](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge-mbta.yaml) + * [e2e-qna-knowledge-phoenix](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge-phoenix.yaml) + * [e2e-qna-knowledge](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge.yaml) ### Pregenerated Dataset (Input)