-
Notifications
You must be signed in to change notification settings - Fork 172
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] maintain sorted array for conditional join #1398
Conversation
🚀 Deployed on https://deploy-preview-1398--pyjanitor.netlify.app |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## dev #1398 +/- ##
==========================================
- Coverage 89.07% 83.72% -5.35%
==========================================
Files 87 87
Lines 5374 5857 +483
==========================================
+ Hits 4787 4904 +117
- Misses 587 953 +366 |
dc4357c
to
af7ef10
Compare
af7ef10
to
cc66145
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wow, this one is a big PR to review. @samukweku I definitely haven't been able to find a good chunk of time to review it in detail. I'm gonna approve it knowing that the tests pass, and that's possibly the best I can do for now.
This is the third part of a series of PRs that ultimately adds support for aggregations within
conditional_join
. the numba code now uses an array that is kept sorted, based on grantjenks' sortedcontainers implementation. In one case, perf. improved by about 50x compared to the current implementation; too large a perf diff?PR Description
Please describe the changes proposed in the pull request:
df
orright
is emptyThis PR relates to #1269, #1396 and #1397 .
Please tag maintainers to review.