Skip to content

Latest commit

 

History

History
40 lines (32 loc) · 2.47 KB

README.md

File metadata and controls

40 lines (32 loc) · 2.47 KB

RetinaNet-mxnet

Adapted from SSD implemented by zhreshold, the results still need to be tuned. Currently we use the PASCAL VOC mAP metric which measures under IoU threshold 0.5, not the COCO AP metric.

Demo Results

image image image

Differences from SSD

  • We build FPN (P3 to P7) to replace the "multi_layer_feature" function;
  • We build cls_subnet and bbox_subnet in the "multibox_layer" function, and the bias is initialized according to the Focal Loss paper (only for FL strategy);
  • We use the anchor setting in the focal loss paper which is tested on COCO, but the best setting for PASCAL VOC still needs to be tuned;
  • We adopt the focal loss operator by eldercrow;
  • We support converting COCO2017 data to rec format for training and validation.

Usage

  • Download COCO2017 data and annotations;
  • Run tools/prepare_coco.sh to pack into rec format, after configuring your own paths;
  • Run train-COCO2017.sh after configuring your own paths and hyperparamters.

For PASCAL VOC and more details, one can generally refer to SSD implemented by zhreshold.

Environment

Tested on Ubuntu 16.04, python3.5, mxnet 1.1.0

Numpy, cv2 and matplotlib are required.

mAP result

Backbone Training data Val data Strategy mAP Note
ResNet-50 512x512 VOC07+12 trainval VOC07 test OHEM 76.0 sgd, lr0.01
ResNet-50 512x512 VOC07+12 trainval VOC07 test FL 75.4 sgd, lr0.01
ResNet-50 512x512 COCO2017 train COCO2017 val OHEM 40.2 sgd, lr0.01
ResNet-50 512x512 COCO2017 train COCO2017 val FL 40.9 sgd, lr0.01

Baseline Faster RCNN

Backbone Training data Val data mAP Note
ResNet-50 600 VOC07+12 trainval VOC07 test 74.8 sgd, lr0.001
ResNet-50 600 COCO2017 train COCO2017 val 37.9 sgd, lr0.003