mAP is too low but detect objects well with trained ckpt #27

Janezzliu · 2018-04-15T04:41:56Z

Hi @LevinJ ,I apply SSD_tensorflow_VOC to my own datasets.I first Train SSD specific weights with self.max_number_of_steps = 10000,then Train VGG16 ad SSD specific weights with self.max_number_of_steps = 900000.First step has finished and second step has reached 60000.My loss is around 1.8 ,training mAP is 0.18 and testing mAP is 0.17. However,when I use trained ckpt to detect objects in testing pictures,it does well! So I go to your codes and website https://sanchom.wordpress.com/tag/average-precision/ to learn how mAP is computed. I don't find anything wrong. I'm quite confused. The testing results with trained ckpt don't match the mAP with 0.17.

Janezzliu · 2018-04-15T14:42:59Z

I have found the reason.My label is 4, I should construct corresponding dictionary variable which stores the AP of every label to compute mAP.

LevinJ · 2018-04-16T01:22:05Z

Hi @Janezzliu ,glad to learn you've got the issue fixed. Nice debugging!

ghost · 2018-04-19T09:31:08Z

@LevinJ , For all poor souls out there that still trying to figure out why there is such a big gap in evaluation:

From the previous repo, the issue was in the generation of tfrecords. The tfrecords have the difficults clamped to 0 , therefore the ground truths are wrong. All ground truths are labeled as non difficults where they shouldn't, because they are sorted out in the bboxes_matching method. The evaluation evaluates on difficults also eventually which gives this big gap. The original caffe implementation achieves 0.69 mAP on evaluation with difficult ground truths.
Note that , you should train on difficults though, they shouldn't be excluded from the training.

I am leaving this comment here since it's the most recent regarding mAP. Also haven't tested your code but I see that the script there is the same, so I am gonna assume that the error persists. I hope this helps.

LevinJ · 2018-04-20T07:26:10Z

Hi @bnbhehe , thanks for sharing your findings!

Can you elaborate a bit on "The tfrecords have the difficults clamped to 0 , therefore the ground truths are wrong. All ground truths are labeled as non difficults where they shouldn't, because they are sorted out in the bboxes_matching method."?

Which lines of code clamped the difficulites attribute of training/evaluation samples to 0? I tried checking the codes, but was not able to find them.

ghost · 2018-04-21T07:31:30Z

in pascal_to_tfrecords script there is this line
if obj.find('difficult'):
the thing is that the find function is NoneType therefore this check is always false. I decoded the tfrecords and there were no difficults. everything had a value of 0 therefore you always evaluate on them. For training though you need them thats why they are not called in the train script.

you can notice this data corruption if you try to evaluate with difficults by switching the remove_difficult flag to false. the map should be the same ( writing from mobile so sorry if my reply is vague)

LevinJ · 2018-04-24T00:03:11Z

Hi @bnbhehe, not sure if it's an environment setup related issue, but it looks obj.find('difficult') can return a valid object in my desktop, as you can see below.

I checked the annotation file, there is indeed a difficult field,

<object>
		<name>dog</name>
		<pose>Left</pose>
		<truncated>1</truncated>
		<difficult>0</difficult>
		<bndbox>
			<xmin>48</xmin>
			<ymin>240</ymin>
			<xmax>195</xmax>
			<ymax>371</ymax>
		</bndbox>
	</object>

What are your thoughts on this?

ghost · 2018-04-24T08:27:18Z

It should return but its NoneType and the check will fail. Only if you call .text that gives a value but if the label is not present it will crash. I used python 3.5 and didnt have the behavior i wanted with this xml parser therefore i rewrote the process_image in the tfrecords script with xmltodict.

I would suggest you check the remove_difficult flag in eval script. It would not affect the map if you turn it off. (spoiler alert: for me it didnt)

I would also make a small decoding script to see the number of difficult annotations present.

Please reply me on what is the best map you get if this is fixed.

LevinJ · 2018-04-24T10:54:28Z

Hi @bnbhehe , I checked the codes a bit more closely and agree that you are correct, currently all bounding boxes are mistakenly labelled as non-difficult.

This is because obj.find('difficult') returns an Element object, the object has a len attribute, which equals to zero, as a result, if obj.find('difficult'): is always evaluated as False. For those who are interested, see here for more details. To fix the bug, one could simply replace the line as if obj.find('difficult') is not None:

As for evaluating the model on the non-difficult ground truth labels, I am currently quite tied up with other stuff, and might do it when i have some time :)

By the way, can you tell me where you find that "The original caffe implementation achieves 0.69 mAP on evaluation with difficult ground truths."? Thanks.

ghost · 2018-04-24T11:10:37Z

@LevinJ, I trained it myself on Caffe. 2 times also. I reproduced the results of the paper and got 70 % and 69 % on evaluation with difficults. If you have the original implementation is basically line 415 in this script. You switch that to True.

Jasonsun1993 · 2018-04-26T06:51:27Z

@Janezzliu I got the same problem. I wondered in which file to construct the dictionary.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mAP is too low but detect objects well with trained ckpt #27

mAP is too low but detect objects well with trained ckpt #27

Janezzliu commented Apr 15, 2018

Janezzliu commented Apr 15, 2018 •

edited

Loading

LevinJ commented Apr 16, 2018

ghost commented Apr 19, 2018 •

edited by ghost

Loading

LevinJ commented Apr 20, 2018 •

edited

Loading

ghost commented Apr 21, 2018 •

edited by ghost

Loading

LevinJ commented Apr 24, 2018

ghost commented Apr 24, 2018

LevinJ commented Apr 24, 2018

ghost commented Apr 24, 2018

Jasonsun1993 commented Apr 26, 2018

mAP is too low but detect objects well with trained ckpt #27

mAP is too low but detect objects well with trained ckpt #27

Comments

Janezzliu commented Apr 15, 2018

Janezzliu commented Apr 15, 2018 • edited Loading

LevinJ commented Apr 16, 2018

ghost commented Apr 19, 2018 • edited by ghost Loading

LevinJ commented Apr 20, 2018 • edited Loading

ghost commented Apr 21, 2018 • edited by ghost Loading

LevinJ commented Apr 24, 2018

ghost commented Apr 24, 2018

LevinJ commented Apr 24, 2018

ghost commented Apr 24, 2018

Jasonsun1993 commented Apr 26, 2018

Janezzliu commented Apr 15, 2018 •

edited

Loading

ghost commented Apr 19, 2018 •

edited by ghost

Loading

LevinJ commented Apr 20, 2018 •

edited

Loading

ghost commented Apr 21, 2018 •

edited by ghost

Loading