Add automatic evaluation with LLM-as-a-Judge, LangSmith export, and SGI evaluation #989
Job | Run time |
---|---|
8s | |
1m 29s | |
1m 18s | |
1m 50s | |
1m 58s | |
3m 42s | |
2m 5s | |
2m 4s | |
3m 22s | |
17m 56s |
Job | Run time |
---|---|
8s | |
1m 29s | |
1m 18s | |
1m 50s | |
1m 58s | |
3m 42s | |
2m 5s | |
2m 4s | |
3m 22s | |
17m 56s |