From 3457b7205baa763054b90f2ac413c50f548f891a Mon Sep 17 00:00:00 2001 From: "Ziru \"Ron\" Chen" Date: Thu, 3 Oct 2024 15:10:27 -0400 Subject: [PATCH] v1 finale --- index.html | 7 +------ 1 file changed, 1 insertion(+), 6 deletions(-) diff --git a/index.html b/index.html index 7dff8a2..bbe08d2 100644 --- a/index.html +++ b/index.html @@ -1116,12 +1116,7 @@

Tasks in ScienceAgentBench

-

LogoTravelPlanner - constraint description. The environment constraint is manifested through the feedback received from the - environment, assessing whether the language agent can adjust its plan appropriately. The commonsense - constraint and hard constraint are evaluated based on how well the language agent's plan aligns with - these specific criteria. -

+

Example tasks in ScienceAgentBench.