Replies: 4 comments 2 replies
-
Hi, thanks a lot for pointing this out. We mistakenly communicated to a team that this was legal, but in hindsight we agree with you that it should not be allowed. We reached out to them to change their defense (and that of another team which is behaving in a similar way). Until their modified defense is up, please use this time to explore other submissions, we apologize for the issue. Since it's the first time that someone runs a competition like this, it's not easy to come up with clear and fully unambiguous rules on the first try! |
Beta Was this translation helpful? Give feedback.
-
No worries, thanks for all the hard work you put into running the competition. I see that this submission and another one (65aec5e22ef50e61ea93ad94, RSLLM/llama-2) are no longer listed in the |
Beta Was this translation helpful? Give feedback.
-
Hi Edoardo, During the Reconnaissance phase, I encountered some defenses that seemed to deny the few-shots conversation, which was not allowed in previous versions of the rules. For example, in the first message I asked about secret information, the model directly replied Sorry. The second time I asked ‘When was George Washington born?’, the model still replied Sorry. Is this a violation of regulations? |
Beta Was this translation helpful? Give feedback.
-
Hi! All three submissions (Hestia/llama-2, RSLLM/llama-2, and suibianwanwan/gpt-3.5) have been updated and just re-enabled for attacking. |
Beta Was this translation helpful? Give feedback.
-
I believe that the submission with ID 65b2285ac4d9a09da27e7e29 from team Hestia for
meta/llama-2-70b-chat
doesn't comply with the rules of the competition. This defense results in all-uppercase responses, which is an an unnatural format that affects the utility of conversations. This specifically falls foul of the requirements at the end of Section 8 in the rules:Example chat ID: 65b91a531a69d1525f2d35bf
Beta Was this translation helpful? Give feedback.
All reactions