Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-50947][PYTHON][SQL][CONNECT] Assign appropriate error class and SparkException for duplicated artifacts #49598

Closed
wants to merge 2 commits into from

Conversation

itholic
Copy link
Contributor

@itholic itholic commented Jan 22, 2025

What changes were proposed in this pull request?

This PR proposes to assign appropriate error class and SparkException for duplicated artifacts.

Why are the changes needed?

To convert SparkConnectGrpcException into proper PySparkException so that we can ensure handling exceptions from Spark Connect Server properly.

Does this PR introduce any user-facing change?

No API changes, but the user-facing error message would be improved.

How was this patch tested?

Updated the existing test.

Was this patch authored or co-authored using generative AI tooling?

No.

@itholic itholic requested a review from HyukjinKwon January 22, 2025 06:52
@itholic
Copy link
Contributor Author

itholic commented Jan 22, 2025

Also cc @MaxGekk fyi

@itholic itholic changed the title [SPARK-50947][PYTHON][SQL][CONNECT] Assign appropriate error class for duplicated artifacts [SPARK-50947][PYTHON][SQL][CONNECT] Assign appropriate error class and SparkException for duplicated artifacts Jan 22, 2025
@MaxGekk
Copy link
Member

MaxGekk commented Jan 23, 2025

The test failure is not related to the changes:

[info] - SPARK-29442 Set `default` mode should override the existing mode *** FAILED *** (9 milliseconds)
[info]   java.lang.NoSuchFieldException: mode
[info]   at java.base/java.lang.Class.getDeclaredField(Class.java:2610)
[info]   at org.apache.spark.sql.DataFrameSuite.$anonfun$new$476(DataFrameSuite.scala:2061)
[info]   at org.scalatest.enablers.Timed$$anon$1.timeoutAfter(Timed.scala:127)

@MaxGekk
Copy link
Member

MaxGekk commented Jan 23, 2025

+1, LGTM. Merging to master/4.0.
Thank you, @itholic.

@MaxGekk MaxGekk closed this in 6e4240f Jan 23, 2025
MaxGekk pushed a commit that referenced this pull request Jan 23, 2025
…d SparkException for duplicated artifacts

### What changes were proposed in this pull request?

This PR proposes to assign appropriate error class and SparkException for duplicated artifacts.

### Why are the changes needed?

To convert SparkConnectGrpcException into proper PySparkException so that we can ensure handling exceptions from Spark Connect Server properly.

### Does this PR introduce _any_ user-facing change?

No API changes, but the user-facing error message would be improved.

### How was this patch tested?

Updated the existing test.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #49598 from itholic/duplicated_artifact.

Authored-by: Haejoon Lee <[email protected]>
Signed-off-by: Max Gekk <[email protected]>
(cherry picked from commit 6e4240f)
Signed-off-by: Max Gekk <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants