-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minor: Use upstream concat_batches
from arrow-rs
#11615
Conversation
@ozankabak mentioned #11610 (comment) there was work in progress for |
batches.len(), | ||
row_count | ||
); | ||
arrow::compute::concat_batches(schema, batches) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The existing function simply wraps concat_batches
in arrow and logs some information which I don't think is critical. Note that the row_count
is only used for the logging. If reviewers think the trace!
is valuable, I will put it back (at the callsites)
@@ -364,7 +364,7 @@ async fn collect_left_input( | |||
let stream = merge.execute(0, context)?; | |||
|
|||
// Load all batches and count the rows | |||
let (batches, num_rows, metrics, mut reservation) = stream | |||
let (batches, _num_rows, metrics, mut reservation) = stream |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The calculation of _num_rows
can be removed here now...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good call -- will do as a follow on PR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Which issue does this PR close?
N/A
Rationale for this change
While working on #11610 I found that there is a wrapper over
concat_batches
The concat batches function is now available in arrow-rs so let's use that
What changes are included in this PR?
Use arrow concat_batches directly rather than via a wrapper
Are these changes tested?
By existing CI
Are there any user-facing changes?
No