New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Synth eval #24

Merged

matthewcoole merged 2 commits into main from synth-eval

Dec 20, 2024

Collaborator

matthewcoole commented Dec 20, 2024

Closes #23 by adding LLM as a judge evaluation step to the synthetic test set generation which checks that:

Synthetic questions are clear
Reference a dataset that they are based if they are not general questions
Have an appropriate ground truth to evaluate against.

Synthetic questions not meeting these criteria are removed from the set used to evaluate.

matthewcoole added 2 commits

December 19, 2024 16:21


          adds llm as a judge evaluation of synthetic testset

679a2d2


          cleans synth eval prompt

2b9c1b4

matthewcoole merged commit 8016933 into main

1 check passed

matthewcoole deleted the synth-eval branch

December 20, 2024 08:47

github-actions bot commented Dec 20, 2024

answer_correctness: 0.5072450137294304
answer_relevancy: 0.5050751636311221
context_recall: 0.5142385736312046
context_precision: 0.4634856011771453

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet