Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do you have object detection bounding box visualization results? #2

Open
c0nn3r opened this issue Sep 11, 2017 · 8 comments
Open

Do you have object detection bounding box visualization results? #2

c0nn3r opened this issue Sep 11, 2017 · 8 comments

Comments

@c0nn3r
Copy link
Contributor

c0nn3r commented Sep 11, 2017

Trying to debug my implementation's training on COCO, would love to see if you got this repo working on VOC.

@andreaazzini
Copy link
Owner

So, I'm still far from achieving good results on VOC (I had the time to complete only 72 epochs). Some examples (good, promising and bad).

2008_001271
2012_000745
2012_002009

My gut feeling is that something is still wrong in the encoder.

@andreaazzini
Copy link
Owner

@c0nn3r any better results on your side?

@c0nn3r
Copy link
Contributor Author

c0nn3r commented Sep 12, 2017

Nothing yet, having trouble batching the bounding boxes. Does it only work when only one bounding box is needed to detect (or a lower number)? I'm thinking of writing tests for encoder.py and just brute force a working solution. I'm going to switch to getting VOC working.

@c0nn3r
Copy link
Contributor Author

c0nn3r commented Sep 12, 2017

Also @andreaazzini, what are you using to produce the images shown?

@andreaazzini
Copy link
Owner

@c0nn3r https://github.com/andreaazzini/retinanet.pytorch/blob/328f63e12fdcd9e5b11f8828828b302245b13cf8/demo.py
I removed demo.py from master because I'm working on an evaluation script, and the demo script needs to be changed accordingly.

@c0nn3r
Copy link
Contributor Author

c0nn3r commented Sep 13, 2017

@andreaazzini I took a look at the loss function and found some problems:

  • I don't think we are normalizing focal loss over the number of background detections / ground-truth boxes (it was italicised in the paper, so it must be important!).
  • The paper claims to have ~100K anchor predictions for the box subnet, that doesn't seem to match our ~47961.

@andreaazzini
Copy link
Owner

@c0nn3r Are you using my encoder or @kuangliu's?

@c0nn3r
Copy link
Contributor Author

c0nn3r commented Sep 13, 2017

@andreaazzini I'm using your encoder

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants