Question about object detection #75

haoyi-duan · 2024-10-22T17:22:53Z

Hi, nice work! I have a question about your 'object class' dimension. My understanding is that Grit can predict some detected objects and their corresponding captions, which might not exactly be the same in the original prompt. For example, the original prompt could be 'a cat', the captions of the detected objects could end up being 'an orange cat', 'grass', 'bench', .... How do I check the object ('a cat') is successfully detected ('an orange cat')? Is it using LLM or clip similarity?

ziqihuangg assigned yinanhe Dec 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about object detection #75

Question about object detection #75

haoyi-duan commented Oct 22, 2024

Question about object detection #75

Question about object detection #75

Comments

haoyi-duan commented Oct 22, 2024