Use the same train-val-test split #4

hermancollin · 2024-09-06T14:51:42Z

In the preprocessing file, the data is first converted to YOLO format, then COCO. The problem is that the data is shuffled at both steps, resulting in random non-matching data splits. Both splits should be identical, otherwise how could we compare both methods???

There should be a JSON or YML file with a hardcoded data split. Both preprocess_data_coco() and preprocess_data_yolo() should split the data identically.

The text was updated successfully, but these errors were encountered:

hermancollin · 2024-09-16T18:58:28Z

We should hardcode the data split in a JSON file inside this repo. It should look like this:

{
   train: [
      "sub-rat1_sample-[...].png",
      "sub-rat2_sample-[...].png",
      [...],
   ],
   val: [
      [...]
   ],
   test: [
      [...]
   ]
}

hermancollin assigned hermancollin, MurielleMardenli200 and edgark31 and unassigned hermancollin Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the same train-val-test split #4

Use the same train-val-test split #4

hermancollin commented Sep 6, 2024

hermancollin commented Sep 16, 2024

Use the same train-val-test split #4

Use the same train-val-test split #4

Comments

hermancollin commented Sep 6, 2024

hermancollin commented Sep 16, 2024