[BUG] Compute Sanitizer reports cudaErrorInvalidConfiguration error when writing ORC #13887
Labels
0 - Backlog
In queue waiting for assignment
bug
Something isn't working
cuIO
cuIO issue
libcudf
Affects libcudf (C++/CUDA) code.
Milestone
Describe the bug
I found the y dimension of dim3 is zero when writing ORC with all columns are non-String, details:
https://github.com/rapidsai/cudf/blob/v23.08.00/cpp/src/io/orc/dict_enc.cu#L68-L70
grid_size.y is 0 when I print it out. This means the num_str_cols is zero.
From ChatGPT: Each dimension (x, y, and z) in dim3 must have a value greater than zero.
The zero dimension casued the error:
Steps/Code to reproduce bug
The steps are in NVIDIA/spark-rapids-jni#1338.
And the steps are time consuming consider to the compilation.
You can review and check the cuDF code and I can verify this issue after a fix.
Please let me know If you have any problem in the reproduce steps.
Expected behavior
Fix the error.
It's better to check if this issue causes other problems.
Check if this issue impacts performance.
Environment overview (please complete the following information)
cuDF branch 23.10
Environment details
Additional context
Refer to JNI issue: NVIDIA/spark-rapids-jni#1338
If update the test case to write a ORC file with a String column, the y dimension is non-zero and error disappears.
Refer to:
cudf/java/src/test/java/ai/rapids/cudf/TableTest.java
Line 8610 in 1d58d5f
==>>
compute-saniziter is a tool can detect some GPU memory relevant issues
The text was updated successfully, but these errors were encountered: