-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Could you release small/tiny/nano version of detector and descriptor? #30
Comments
Sure, the easiest I guess would be using vgg11 and reducing layers further. Should be doable. Not sure how much performance will degrade. |
about 30fps in RTX4060 mobile 8G. |
@zhongqiu1245 could you try out the small detector in the branch that references this issue? Weights can be found here: https://github.com/Parskatt/DeDoDe/releases/tag/v2 |
It uses a VGG11 backbone and I reduced the number of layers at each scale from 8 -> 4 and cut the dimensionality in half. I think it should be about 3-4X faster than the _L detector. Could you verify? |
Depending on your application it might also be possible to increase the framerate by batching, is this an option for you? |
@Parskatt |
@Parskatt So I reduce the shape of img to 320 * 240, then fps=25, almost there. |
Sure, then I think we can also reduce descriptor size. Does 128 sound better? Is descriptor dimensinality a concern? |
Thank you for your reply ! |
some details: |
Okay, so seems like around 20fps is at least possible with current sizes. Are you able to extract the times for the encoder/decoder parts of the network? Depending on what is taking most time might need to change enc architecture. The final thing I guess would be to distill both networks into a single network. |
ok, I will try later. |
Hello, thank you for your amazing job!
I'm really interesting of your job and want to deploy DeDoDe on mobile devices(laptop, even CPU) for some self-driving works.
But I find it is too heavy for mobile device to run DeDoDeDescriptorB, DeDoDeDetectorL.
In my computer(RTX4060 mobile 8G), only 5.4 fps when inputs with 640*480 (tensorrt_fp16)
Could you release small/tiny/nano version of detector and descriptor?
Thank you in advance!
The text was updated successfully, but these errors were encountered: