-
Notifications
You must be signed in to change notification settings - Fork 447
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] Disable columnar table cache by default #3488
[VL] Disable columnar table cache by default #3488
Conversation
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/oap-project/gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
Run Gluten Clickhouse CI |
@ulysses-you, could you please review this PR? |
backends-velox/src/test/scala/io/glutenproject/execution/VeloxColumnarCacheSuite.scala
Outdated
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
backends-velox/src/test/scala/io/glutenproject/execution/VeloxColumnarCacheSuite.scala
Outdated
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
gluten-ut/spark33/src/test/scala/io/glutenproject/utils/velox/VeloxTestSettings.scala
Outdated
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/ListenerApiImpl.scala
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM if test pass
Run Gluten Clickhouse CI |
@gaoyangxiaozhu it seems you only change the ut of Spark-3.3. We should also change the ut of Spark-3.2 |
hey @ulysses-you i think |
I mean we lose the test coverage for Spark3.2 since this pr does not change the columnar cache config for Spark3.2. yes, the failed test is for Spark3.3, and it also passed in my local. I'm not sure why github action failed. Can you rebase the pr and push it again? thank you |
Run Gluten Clickhouse CI |
092913c
to
a52f7a3
Compare
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
@@ -27,6 +29,7 @@ class GlutenCachedTableSuite | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One idea is to add config to system properties:
sys.props.put(GlutenConfig.COLUMNAR_TABLE_CACHE_ENABLED.key, "true")
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can anyone from Gluten to help check why the test case fail in CI since we already enable the conf ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The reason is that, Spark has a global cache for cache serializer. So in one jvm, if there are some table cache test suites which use the default cache serializer running before this test, then the default cache serializer had been cached. Therefore, we can not change the cache serializer to columnar for the specified test.
We can set this config by sys.props.put(GlutenConfig.COLUMNAR_TABLE_CACHE_ENABLED.key, "true")
to work around it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i have some issue to run total ut locally to verify.
@ulysses-you you mean do chang like below ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes
Run Gluten Clickhouse CI |
1 similar comment
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
@@ -24,9 +26,11 @@ class GlutenCachedTableSuite | |||
extends CachedTableSuite | |||
with GlutenSQLTestsTrait | |||
with AdaptiveSparkPlanHelper { | |||
|
|||
// for temporarily disable the columnar table cache globally. | |||
sys.props.put(GlutenConfig.COLUMNAR_TABLE_CACHE_ENABLED.key, "true") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we should also do this for Spark3.4 which is merged recently
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
===== Performance report for TPCH SF2000 with Velox backend, for reference only ====
|
disable the columnar table cache in default
related issue 3456
How was this patch tested?
pass unit tests