unlabeled data how will effect on my model #12242
-
Hello, So, if I add some random images from imagenet (without labels) to a face-detecting Yolo model, will I strengthen my model? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
The Ultralytics YOLOv5 documentation mentions the use of "background images" specifically to reduce false positives (FPs) during object detection. These are images that contain no instances of the objects you're interested in detecting and are included in your training set without any labels. According to the guide, they recommend using about 0-10% background images relative to your dataset size to help reduce FPs. In this context, adding unlabeled, unrelated images would act as these "background images." The key purpose is to teach the model what doesn't constitute the object of interest, thereby reducing the likelihood of false positives. This is directly in line with what you're asking; adding such images should improve the model's ability to discern the object you're interested in detecting from other unrelated objects or backgrounds. However, the addition should be done carefully to maintain the balance in your dataset and not to introduce any class imbalance or skewed learning. So, according to this specific guide, adding random images from ImageNet without labels (effectively treating them as background images) is a recommended practice to improve your model's performance by reducing false positives. |
Beta Was this translation helpful? Give feedback.
The Ultralytics YOLOv5 documentation mentions the use of "background images" specifically to reduce false positives (FPs) during object detection. These are images that contain no instances of the objects you're interested in detecting and are included in your training set without any labels. According to the guide, they recommend using about 0-10% background images relative to your dataset size to help reduce FPs.
In this context, adding unlabeled, unrelated images would act as these "background images." The key purpose is to teach the model what doesn't constitute the object of interest, thereby reducing the likelihood of false positives. This is directly in line with what you're asking…