You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Spark Connect adds another interaction approach, splitting driver and executor. Prior to Spark 4 the interfaces were separated, with the introduction of a base class this is now possible.
Pro's:
works on Spark 4
opens up the possibility of connect usage
Con's:
possible performance hit as TransformingEncoders are required so wholestage codegen is no longer possible (unknown and probably not relevant)
The text was updated successfully, but these errors were encountered:
below is not relevant per repurpose of issue to Spark 4 focus
Based on the source code the ConvertToArrow (called by connects' SparkSession) and ExpressionEncoder only support the built in AgnosticEncoders (logic present in ScalaReflection) - it's a locked in system, no way to inject behaviour without classpath hackery.
So custom types, typed datasets (different api ) and injections - i.e. all the cool stuff - don't seem to be possible with Spark Connect as it stands in 3.4.
Spark Connect adds another interaction approach, splitting driver and executor. Prior to Spark 4 the interfaces were separated, with the introduction of a base class this is now possible.
Pro's:
Con's:
The text was updated successfully, but these errors were encountered: