Replies: 4 comments 6 replies
-
"object size is very small" doesn't give us enough info. What is your network dimensions, and how small are the objects when the images are resized to the network dimensions? To understand: https://www.ccoderun.ca/darkhelp/api/Tiling.html If you have 100 classes, you should train for about 200,000 iterations. You shouldn't be looking at results after only 4900 iterations. Also see https://www.ccoderun.ca/programming/darknet_faq/#how_many_images |
Beta Was this translation helpful? Give feedback.
-
You need to re-read those pages. I wasn't asking you to try tiling, I needed you to understand the implications of your choices when it comes to sizing. Eg, take your 6025 x 4015 image, resize it to 608x608 and let us know the size of your objects once resized. |
Beta Was this translation helpful? Give feedback.
-
Regardless, even ignoring the other problems in the previous comment, the minimum for max batches is 6000, so you still would have to train some more. |
Beta Was this translation helpful? Give feedback.
-
It does work. It works very well! See my youtube channel for examples.
You need to re-read my page. After resizing, if the objects are smaller than 16x16, you'll have problems. If smaller than 10x10, it cannot be detected. Since your objects are as small as 3x7, I can guarantee you it wont work without using something like DarkHelp or your own implementation of tiling. |
Beta Was this translation helpful? Give feedback.
-
@AlexeyAB I am trying to train YOLO v4 to detect OCR from images, and the object size is very small when compared to the actual image size (I used all the recommendations provided in https://github.com/AlexeyAB/darknet#how-to-improve-object-detection section ). I have to train the network to detect about 100 classes but I want to make sure that the detection works, so this time I used YOLO v4 with yolov4_new.cfg for 3 classes. I changed the
anchors=
,filters=24
andclasses=3
for each YOLO layer and trained the network with the custom train and validation data-sets on Google Colab. According to the avg loss and map plot the training went well and the avg loss was 0.1395 at 4900 iterations.map
on all the weight files and I am not getting any map values.Also, I used pre-trained convolutional layers to start the training (yolov4.conv.137)
I am at a complete loss as to why I am not able to detect any of the objects, even though the training went well (I think)
Beta Was this translation helpful? Give feedback.
All reactions