Highlights
- Pro
Popular repositories Loading
-
-
manipulation-chatarena
manipulation-chatarena PublicForked from Farama-Foundation/chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
Python 2
-
evals_manip
evals_manip PublicForked from openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Python 1
-
-
tasks-goose
tasks-goose PublicForked from magikarp01/tasks
My own set of general evaluations to be shared between projects
Jupyter Notebook 1
If the problem persists, check the GitHub status page or contact support.