Question on the Loss function #3

simo23 · 2017-09-05T18:00:17Z

Hi, thanks for your very useful code!

I want to ask you something about the loss function, because I've seen it implemented in many ways but this seems the best one:

Is it correct to compute a single mask for an entire batch and compute the loss on it? This should be the wrong way because many objects could be in the same cell, so we will have overlapping. I'm quite sure this is not the way it is implemented here.
Is it correct to compute the mask for each single image and then compute the loss as the average on the batch, like:
```
total_loss = 1/N sum_for_every_image(image_loss)
```

This is the way it is implemented here I think, but can you confirm this? Unfortunately it is not clear how to compute it exactly.

Also can you explain the following line found in "yolo_v2_confidence_loss"?

  object_no_detections = tf.cast(iou < 0.6, tf.float32) 
  objects_loss = object_scale * object_mask * object_no_detections * tf.square(1 - box_confidence)

I imagine you are thresholding the iou between the ground truth boxes and the predictions, but can you explain this choice? Are you saying that even if the box is in the right cell it must have iou > 0.6 w.r.t. ground truth to be counted in this loss?

Lastly, are you planning to add the possibility to load pre-trained weights? It will be useful and if you want I could also help.

Thanks, Andrea

The text was updated successfully, but these errors were encountered:

PaulChongPeng · 2017-09-17T03:26:40Z

total_loss = 1/N sum_for_every_image(image_loss)
If the IOU between prediction box and ground truth box is biger than 0.6, I think it's a right cell. Otherwise, it contribute to the object loss.

simo23 · 2017-11-07T09:21:07Z

Hi,
thanks for your answer. I have another question tough.

By applying that any box with iou > 0.6 with a ground_truth box is correct you are basically justifying all detectors (of any cell, and any anchor) to go to objects they are not necessarily responsible for am I right?

Would it be a more correct way to find the cell in which the ground truth box has the center, than select in that cell the anchor with the best iou with the ground truth box and then keep a single 1 in the loss mask ( in the right cell, at the right anchor ) for each ground truth object?

Thanks,
Andrea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on the Loss function #3

Question on the Loss function #3

simo23 commented Sep 5, 2017

PaulChongPeng commented Sep 17, 2017

simo23 commented Nov 7, 2017 •

edited

Loading

Question on the Loss function #3

Question on the Loss function #3

Comments

simo23 commented Sep 5, 2017

PaulChongPeng commented Sep 17, 2017

simo23 commented Nov 7, 2017 • edited Loading

simo23 commented Nov 7, 2017 •

edited

Loading