[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime & improved debugging #535

jalpan-randeri · 2023-12-01T07:38:18Z

Under Adaptive query execution mode, Spark overlaps planning and execution phase, This results in spark running planning multiple time. The current implementation eagerly pushdown query in planning stage, this result into redundant query pushdown to snowflake and it ignores the runtime discovered filters.

This commit handles this scenario and delayed pushdown, This gives Spark AQE a chance to generate the most optimum plan and eliminating the pushdown of redundant queries. This results in improved performance as new filters identified at runtime by AQE are pushdown.

Furthermore, it logs the pushdown query into spark plan. This allow easy debugging from Spark History Server and UIs and from logs.

This PR adds new unit test suit for it.

This commit handles the scenario where Apache Spark is running under AQE mode, snowflake connector to delayed pushdown. This allows spark to generate more optimium query plan. This results in improved performance. Furthermore, it logs the pushdown query into spark plan. This allow easy debugging from Spark History Server and UIs.

urosstan-db · 2024-06-24T13:31:37Z

@jalpan-randeri Do we plan to merge this, this can fix following issue also
#567

urosstan-db · 2024-06-24T13:32:40Z

@sfc-gh-bli Do we plan to merge this PR?

jalpan-randeri · 2024-06-24T14:59:30Z

Yes, i plan to merge this. However I am waiting for review of this PR. Can you review it?

urosstan-db · 2024-06-24T15:22:39Z

src/main/scala/net/snowflake/spark/snowflake/pushdowns/SnowflakeScanExec.scala

+  @transient implicit private var data: Future[PushDownResult] = _
+  @transient implicit private val service: ExecutorService = Executors.newCachedThreadPool()
+
+  override protected def doPrepare(): Unit = {


What is benefit of building RDD in doPrepare instead of doExecute?

The doPrepare method allows spark planner in performing initial metadata collection work in async fashion. While do execute is always blocking call. Thus leveraging doPrepare we can move some work such as building sql and creating connection with snowflake in background thread while main planner operates on other nodes in the plan giving some perf gains

urosstan-db · 2024-06-24T15:23:47Z

Yes, i plan to merge this. However I am waiting for review of this PR. Can you review it?

Overall, it looks good, but I am not commiter, so you need approval from someone from snow

jalpan-randeri · 2024-06-25T15:27:33Z

@sfc-gh-bli please review and share your thoughts

jalpan-randeri requested review from sfc-gh-sshankar, sfc-gh-ema and sfc-gh-bli as code owners December 1, 2023 07:38

jalpan-randeri mentioned this pull request Dec 19, 2023

[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime #536

Open

urosstan-db mentioned this pull request Jun 24, 2024

EXPLAIN command executes query on Snowflake side #567

Open

urosstan-db reviewed Jun 24, 2024

View reviewed changes

jalpan-randeri mentioned this pull request Aug 2, 2024

SNOW-1563484 Remove Advanced Query Pushdown Feature #572

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime & improved debugging #535

[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime & improved debugging #535

jalpan-randeri commented Dec 1, 2023

urosstan-db commented Jun 24, 2024

urosstan-db commented Jun 24, 2024

jalpan-randeri commented Jun 24, 2024

urosstan-db Jun 24, 2024

jalpan-randeri Jun 24, 2024

urosstan-db commented Jun 24, 2024

jalpan-randeri commented Jun 25, 2024

[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime & improved debugging #535

Are you sure you want to change the base?

[Performance Improvement] Support for AQE mode for delayed query pushdown for optimum runtime & improved debugging #535

Conversation

jalpan-randeri commented Dec 1, 2023

urosstan-db commented Jun 24, 2024

urosstan-db commented Jun 24, 2024

jalpan-randeri commented Jun 24, 2024

urosstan-db Jun 24, 2024

Choose a reason for hiding this comment

jalpan-randeri Jun 24, 2024

Choose a reason for hiding this comment

urosstan-db commented Jun 24, 2024

jalpan-randeri commented Jun 25, 2024