Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use the same train-val-test split #4

Open
hermancollin opened this issue Sep 6, 2024 · 1 comment
Open

Use the same train-val-test split #4

hermancollin opened this issue Sep 6, 2024 · 1 comment
Assignees

Comments

@hermancollin
Copy link
Member

In the preprocessing file, the data is first converted to YOLO format, then COCO. The problem is that the data is shuffled at both steps, resulting in random non-matching data splits. Both splits should be identical, otherwise how could we compare both methods???

Screenshot_20240906_105049

Screenshot_20240906_105118

There should be a JSON or YML file with a hardcoded data split. Both preprocess_data_coco() and preprocess_data_yolo() should split the data identically.

@hermancollin
Copy link
Member Author

We should hardcode the data split in a JSON file inside this repo. It should look like this:

{
   train: [
      "sub-rat1_sample-[...].png",
      "sub-rat2_sample-[...].png",
      [...],
   ],
   val: [
      [...]
   ],
   test: [
      [...]
   ]
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants