About white box attack #15

payphone131 · 2024-12-25T09:11:14Z

Hi, I found your work really interesting and tried to reproduce some results. I found a field "flagged" in line 42 generate_init_image.py, however, I have not generated a json file that contains field "flagged". Could you please tell me how to solve this or do I miss something that could generate such a json file?

Vincent-HKUSTGZ · 2025-01-06T09:47:32Z

I also faced this problem. How to solve it?

pyogher · 2025-01-08T12:10:59Z

@payphone131 @Vincent-HKUSTGZ Hi, in our white box attack settings, we select the images, which can jailbreak MLLMs with optimized black-box images, as initial images, hence the function process_data in generate_init_image.py is to filter out the failed cases in black-box cases.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About white box attack #15

About white box attack #15

payphone131 commented Dec 25, 2024

Vincent-HKUSTGZ commented Jan 6, 2025

pyogher commented Jan 8, 2025

About white box attack #15

About white box attack #15

Comments

payphone131 commented Dec 25, 2024

Vincent-HKUSTGZ commented Jan 6, 2025

pyogher commented Jan 8, 2025