-
Notifications
You must be signed in to change notification settings - Fork 447
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] Link lib gluten to arrow's static libraries #6231
Conversation
PHILO-HE
commented
Jun 26, 2024
•
edited
Loading
edited
- Removes linking libvelox.so with arrow libs, which is useless.
- With static arrow libs linked, no need to keep lib arrow & lib parquet in Gluten Jar.
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/apache/incubator-gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
2c8732f
to
5b4dbdf
Compare
085a295
to
9d6654b
Compare
Perf regression observed |
/Benchmark Velox |
c0f1b69
to
3dc5daf
Compare
3dc5daf
to
72021cb
Compare
@FelixYBW, seems the regression is caused by other code or some other test factors. After code rebase, AWS TPCH's perf. is 1067.17s now. |
===== Performance report for TPCH SF2000 with Velox backend, for reference only ====
|
pushd $ARROW_PREFIX/cpp | ||
|
||
cmake_install \ | ||
-DARROW_PARQUET=ON \ | ||
-DARROW_FILESYSTEM=ON \ | ||
-DARROW_PROTOBUF_USE_SHARED=OFF \ | ||
-DARROW_DEPENDENCY_USE_SHARED=OFF \ | ||
-DARROW_DEPENDENCY_SOURCE=BUNDLED \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-DARROW_DEPENDENCY_SOURCE=BUNDLED
does this change really necessary? some deps may already installed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Yohahaha, thanks for your comment. If not bundled, we need some extra code to support resolve those installed dependencies and link them. Considering arrow libs are generally built once then installed, currently we can simply use BUNDLED to get a lib of all arrow's bundled dependencies.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Considering arrow libs are generally built once then installed
it's important, hope we can reduce changes of arrow...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Yohahaha, I will take a look.
Observed some issues in dynamic build after this change
The build command used is
|
@PHILO-HE Update, if I change to |
@zhztheplayer, thanks for your feedback! I'll fix it. |
===== Performance report for TPCH SF2000 with Velox backend, for reference only ====
|
===== Performance report for TPCDS SF2000 with Velox backend, for reference only ====
|
@PHILO-HE can you confirm if BUILD_TYPE=relWithDebInfo is passed to Arrow when you build arrow? Looks the main branch doesn't pass it. |
@FelixYBW, currently, we always build arrow with Release type, assuming generally no need to align the build type with main project. Hongze has helped fixed some issues related to thrift. And now both arrow/velox can use system installed arrow libs. If you have any issue, please let me know. |