Assertion Error #26

Masrur02 · 2024-08-20T20:37:43Z

I have modifed the python grounded_sam2_local_demo.py file for predicting from a video file. I found that, grounding_dino/grounddino/utils/inference.py has a function

def load_image(image_path: str) -> Tuple[np.array, torch.Tensor]:
    transform = T.Compose(
        [
            T.RandomResize([800], max_size=1333),
            T.ToTensor(),
            T.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225]),
        ]
    )
    image_source = Image.open(image_path).convert("RGB")
    image = np.asarray(image_source)
    image_transformed, _ = transform(image_source, None)
    return image, image_transformed

Here, the height of the image is resized to 800 and the maximum width is to 1333. However, if I changed the height to 400 and the max_size to 600 (it maintains the aspect ratio), I get an error like this

I reduced the image size to get a higher FPS. How can I solve this issue? Moreover is there any other way to increase the FPS?

TIA

The text was updated successfully, but these errors were encountered:

rentainhe · 2024-08-21T00:44:37Z

Dear @Masrur02

I think this issue is very similar to #10

Would you like to check the grounding results to see if there are grounding results output or not

ZhangT-tech · 2024-08-21T13:14:33Z

It worked when you add a '.' after your object, so instead of TEXT_PTOMPT="bird", it should be "bird."

SJP2022 · 2024-09-25T03:38:44Z

It worked when you add a '.' after your object, so instead of TEXT_PTOMPT="bird", it should be "bird."

@ZhangT-tech @rentainhe
The annotation # VERY important: text queries need to be lowercased + end with a dot is in the code.
The dot successfully solve this problem, but I wonder why, and do you have any insights? Thanks a lot!

ZhangT-tech · 2024-09-26T09:03:07Z

The actual reason of the assertion error happened is because the model didn't detect anything in the video/img, the original code repo didn't differentiate it, you can manually add a if statement to skip this video:

if input_boxes.size == 0:
        print("No objects detected, skipping this video.")
        print(f"The video is from {SOURCE_VIDEO_FRAME_DIR}")
        return True
    print(input_boxes)

And in terms of the '.', it is just format thing that implicitly in their code, like a End of Token, to separate the objects you want to segment. For example in their grounded_sam2_gd1.5_demo.py, they use '.' to separate 2 objects.

SJP2022 · 2024-09-26T09:10:13Z

I get it, thank you for your explanation!

garychan22 · 2024-12-29T13:51:42Z

I have encountered this issue when directly running the sample code grounded_sam2_local_demo.py, how can i resolve this problem?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assertion Error #26

Assertion Error #26

Masrur02 commented Aug 20, 2024

rentainhe commented Aug 21, 2024 •

edited

Loading

ZhangT-tech commented Aug 21, 2024

SJP2022 commented Sep 25, 2024 •

edited

Loading

ZhangT-tech commented Sep 26, 2024 •

edited

Loading

SJP2022 commented Sep 26, 2024

garychan22 commented Dec 29, 2024

Assertion Error #26

Assertion Error #26

Comments

Masrur02 commented Aug 20, 2024

rentainhe commented Aug 21, 2024 • edited Loading

ZhangT-tech commented Aug 21, 2024

SJP2022 commented Sep 25, 2024 • edited Loading

ZhangT-tech commented Sep 26, 2024 • edited Loading

SJP2022 commented Sep 26, 2024

garychan22 commented Dec 29, 2024

rentainhe commented Aug 21, 2024 •

edited

Loading

SJP2022 commented Sep 25, 2024 •

edited

Loading

ZhangT-tech commented Sep 26, 2024 •

edited

Loading