-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RadixSortByByte issues in 1.1.1 #96
Comments
Hi icoderaven, If it's at all possible, would it be possible for you to put together a minimal repro case of the The 1.1.1 implementation of It might be possible to save some bits in Thanks! |
Gotcha! Thanks for the detailed explanation! Sorry for the tardy response, but since I'm a little hard pressed on time, I'm going to stick with CUDA 10.2 where everything works swimmingly for now and investigate this sometime later! |
Hi!
So I was testing out the library with CUDA 11, and my application starts failing within ActivateBricksGPU when calling the updated RadixSortByByte function at this line.
I get this error
Searching for this error on the internet seems to suggest issues with aligned kernel calls within thrust library. Switching back to 10.1 gives no issues.
NVIDIA/thrust#936
While I was at it, I realised that the new 1.1.1 implementation seems to use thrust and simply casts the indices to a 64 bit int to perform a (radix) sort on it? Is that correct/inefficient? The paper and the code clearly seems to suggest that the entries in the AUX_BRICK_LEVXYZ should be within very small bounds (specifically, level, which should be 0-5, and the X,Y, and Z values which should only be upto range_res[level]).
The text was updated successfully, but these errors were encountered: