You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, while training with custom fisheye dataset on a pretrained COCO checkpoint, I encounter some "index out of bounds" problems in rapid.py: here
I am running with CUDA_LAUNCH_BLOCKING=1 to get the exact line. The error is:
./aten/src/ATen/native/cuda/IndexKernel.cu:93: operator(): block: [0,0,0], thread: [0,0,0] Assertion -sizes[i] <= index && index < sizes[i] && "index out of bounds" failed.
Adding some prints before the error I can see that penalty_mask is of size: torch.Size([4, 3, 136, 136])
Looking some more into this, it seems that it happens when images from Wepdtof dataset are included in the training set. It does not happen for me, when I only include Habbof or Cepdof images. Maybe some unexpected or strange image resolution...
Hi, while training with custom fisheye dataset on a pretrained COCO checkpoint, I encounter some "index out of bounds" problems in rapid.py: here
I am running with CUDA_LAUNCH_BLOCKING=1 to get the exact line. The error is:
./aten/src/ATen/native/cuda/IndexKernel.cu:93: operator(): block: [0,0,0], thread: [0,0,0] Assertion
-sizes[i] <= index && index < sizes[i] && "index out of bounds"
failed.Adding some prints before the error I can see that penalty_mask is of size: torch.Size([4, 3, 136, 136])
Other values:
b: 2
best_n: tensor([0, 0, 0, 0, 2])
truth_j: tensor([141, 99, 47, 53, 97])
truth_i: tensor([ 81, 109, 86, 96, 108])
I don't seem able to find the root cause as for now, seems like it happens for some images only. Any ideas?
The text was updated successfully, but these errors were encountered: