Fine grained SAM #9

dlangenk · 2023-08-01T10:25:59Z

Big images and small objects don't mix well. SAM is using 1024x1024 pixels as input size. If you have a 6000x8000px image small objects are hard to detect. A way around this is computing an embedding on only of part of the image. However, i think this would call for computing the embedding on the client side on demand or we need to think about an embedding computation service that is faster.

dlangenk · 2023-08-05T20:04:53Z

In the torchserve branch there is already an implementation for getting an embedding of a crop around a point x,y

mzur · 2023-08-07T12:58:15Z

I'm very reluctant to introduce an architectural change (i.e. a torchserve service) for this. BIIGLE/Laravel is designed to use queued jobs for this kind of task. These also scale much better (i.e. to multiple GPU machines). Cropped embeddings could also be implemented with a queued job and storage of the embedding file but since the embedding can (probably) only be used a single time, it's a waste of storage space. This needs more thinking.

dlangenk · 2023-08-07T13:38:39Z

The problem is that you do not want to wait for a cropped embedding for more than a second usually. This is only possible if

we precompute the embedding -> waste of memory and a very large number of possible embeddings
or we have a service running all the time that has the model already loaded (whatever that service is: flask server, torchserve, NVIDIA Triton, ...)

Do we really need to scale to multiple GPU machines for an inference job which usually doesn't take more than a few seconds? Other people must have this issues too. Probably we can look for a solution there.

mzur · 2023-08-07T13:52:16Z

I'm not against it per se. It just raises the complexity of the issue from "I might implement this if I have half a day of free time" to "I might think about it a little more if I have half a day of free time" 😉

I want to avoid a solution that is too specific. For example, now we have "slow" GPU workers and "fast" GPU workers. The slow ones are used by MAIA and the fast ones by SAM. These could also be used by any other module that needs GPU processing. A potential torchserve service should also be generic enough that it is not limited to SAM but could also run other stuff. Otherwise, we need one GPU for each new algorithm that we want to support.

dlangenk · 2023-08-09T17:13:01Z

The fine grained SAM could also make SAM available for Mosaics.

mzur · 2024-03-20T07:59:52Z

While this would only be a workaround and also no solution for tiled images, FeatUp could improve the segmentation resolution (maybe) without having to modify the existing code much.

dlangenk added the enhancement New feature or request label Aug 1, 2023

mzur added this to BIIGLE Roadmap Aug 7, 2023

mzur added discuss and removed enhancement New feature or request labels Aug 7, 2023

mzur added the MI3 label Oct 18, 2023

mzur moved this to Medium Priority in BIIGLE Roadmap Jun 11, 2024

mzur mentioned this issue Jun 11, 2024

HQ-SAM #25

Open

dlangenk mentioned this issue Jul 22, 2024

Permanent Python GPU worker #29

Open

mzur removed the MI3 label Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine grained SAM #9

Fine grained SAM #9

dlangenk commented Aug 1, 2023

dlangenk commented Aug 5, 2023

mzur commented Aug 7, 2023

dlangenk commented Aug 7, 2023

mzur commented Aug 7, 2023

dlangenk commented Aug 9, 2023

mzur commented Mar 20, 2024

Fine grained SAM #9

Fine grained SAM #9

Comments

dlangenk commented Aug 1, 2023

dlangenk commented Aug 5, 2023

mzur commented Aug 7, 2023

dlangenk commented Aug 7, 2023

mzur commented Aug 7, 2023

dlangenk commented Aug 9, 2023

mzur commented Mar 20, 2024