-
Notifications
You must be signed in to change notification settings - Fork 921
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Segmentation fault in cudf::conditional_inner_join with Zero-Row Input #16066
Comments
@aocsa Thank you so much for the tests to reproduce this failure. I know that is a lot of work but it helps us tremendously. I will start investigating this. |
@aocsa I did some investigation. First, the tests seem to cover the empty case already: cudf/cpp/tests/join/conditional_join_tests.cu Lines 409 to 412 in b9a0b72
Also, the test cases you provided seem to have a bug. Note that the expected value in the snippet above ( If I fix your test cases to use For now, I have identified some improvements (additional missing test cases, etc.) and I am trying to track down how the test case you proposed (with |
I found a bug that would cause a segfault and have fixed it in #16094. When the right table has zero rows, conditional left anti-joins were returning a vector of indices containing garbage data. This is now corrected. That PR will be evaluated for inclusion in a 24.06.01 hotfix release with some other fixes -- to be determined. |
Closes rapidsai#16066. I found a bug that would cause the reported segfault and have fixed it in this PR. When the right table has zero rows, conditional left anti-joins were returning a vector of indices containing garbage data. Along the way, I refactored several parts of the conditional join tests and added coverage for more cases. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Nghia Truong (https://github.com/ttnghia) - Vyas Ramasubramani (https://github.com/vyasr) - Yunsong Wang (https://github.com/PointKernel) URL: rapidsai#16094
Describe the bug
There is a bug in cuDF where the cudf::conditional_inner_join operation triggers a Segmentation fault when one of the input values has num_rows = 0. This error occurs during the execution of the join operation.
Steps/Code to reproduce bug
Note: cudf::conditional_inner_join_size works fine, but cudf::conditional_inner_join ends with a segmentation fault.
Expected behavior
Return an empty DataFrame or a specific error.
Environment details
Method of cuDF install: source code
v24.06.00 branch release
The text was updated successfully, but these errors were encountered: