-
Notifications
You must be signed in to change notification settings - Fork 440
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: use LazyTableProvider by default for write_to_deltalake for memory efficiency #3196
base: main
Are you sure you want to change the base?
Conversation
ACTION NEEDED delta-rs follows the Conventional Commits specification for release automation. The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #3196 +/- ##
==========================================
- Coverage 72.19% 72.13% -0.06%
==========================================
Files 138 139 +1
Lines 45292 45311 +19
Branches 45292 45311 +19
==========================================
- Hits 32697 32685 -12
- Misses 10532 10552 +20
- Partials 2063 2074 +11 ☔ View full report in Codecov by Sentry. |
6c2ff51
to
8f85a64
Compare
crates/core/src/operations/write.rs
Outdated
let table_schema = snapshot.input_schema()?; | ||
let plan_schema = plan.schema().as_arrow(); | ||
|
||
if table_schema.fields.len() != plan_schema.fields.len() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think an not_eq comparison on the StructTypes might be better here?
8f85a64
to
dbaa4e1
Compare
…ory efficiency This defaults write_to_deltalake in Python to attempt to use the LazytableProvider for a more stream-like execution. It's currently opted out for schewma evolution since that's not supported by default. Some improvements in schema mismatch detection inside of the operations::write module are required as well Signed-off-by: R. Tyler Croy <[email protected]>
dbaa4e1
to
ba2845b
Compare
This defaults write_to_deltalake in Python to attempt to use the LazytableProvider for a more stream-like execution. It's currently opted out for schewma evolution since that's not supported by default.
Some improvements in schema mismatch detection inside of the operations::write module are required as well