added within_unsorted and within_count #52

abstractqqq · 2023-12-26T02:13:34Z

In some scientific computing tasks, we only need the neighbors within radius (with no requirement on being ordered), and there is no need to sort them, or we only need the neighbor count.

I implemented within_unsorted and within_count to deal with these situations more efficiently.

To check:
run tests and bench

abstractqqq · 2023-12-27T19:17:45Z

@mrhooray Sorry to mention you personally. I just want to make sure you see this PR.

mrhooray · 2024-01-02T04:23:05Z

src/kdtree.rs

+        Ok(evaluated.into_iter().map(Into::into).collect())
+    }
+
+    pub fn within_count<F>(&self, point: &[A], radius: A, distance: &F) -> Result<usize, ErrorKind>


Thanks for the PR @abstractqqq
within_count looks very similar to within_unsorted - do you think it's strictly needed?
It seems the same can be achieved via within_unsorted().len() or similar.

Yes you are right that it can be achieved like that. However, I don't think they compile to the same code, as the runtime has a difference. Doing into_iter will consume the heap and the extra step mapping HeapElement to a tuple still have small cost which will add up depending on heap size.

I am building an application in which one tree is built, and for each element in a column, I need to run a within_count query, and I care about every millisecond spent on this operation. As you can see in the screenshot above, when radius increases, the difference between the runtime gets larger.

From a UX point of view, having a within_count is also more convenient than doing within_unsorted().len().

If the difference comes from the final return, do you think it's worthwhile to extract L124-132/L144-152 to avoid future (unintended) divergence?

I factored out the common part as a function 👍

abstractqqq · 2024-01-17T02:41:57Z

@mrhooray

The lint failure is not a problem of this PR, but something in heap_element.rs. Could you please take a look? Thank you.

added within_unsorted within_count

2798eb9

mrhooray reviewed Jan 2, 2024

View reviewed changes

did cargo fmt

59237b5

mrhooray and others added 2 commits January 22, 2024 22:41

Merge branch 'master' into neighbor_count

c3bf7a4

better org

1584872

mrhooray merged commit 4f3755e into mrhooray:master Feb 2, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added within_unsorted and within_count #52

added within_unsorted and within_count #52

abstractqqq commented Dec 26, 2023 •

edited

Loading

abstractqqq commented Dec 27, 2023

mrhooray Jan 2, 2024

abstractqqq Jan 2, 2024

mrhooray Jan 23, 2024

abstractqqq Feb 1, 2024

abstractqqq commented Jan 17, 2024

added within_unsorted and within_count #52

added within_unsorted and within_count #52

Conversation

abstractqqq commented Dec 26, 2023 • edited Loading

abstractqqq commented Dec 27, 2023

mrhooray Jan 2, 2024

Choose a reason for hiding this comment

abstractqqq Jan 2, 2024

Choose a reason for hiding this comment

mrhooray Jan 23, 2024

Choose a reason for hiding this comment

abstractqqq Feb 1, 2024

Choose a reason for hiding this comment

abstractqqq commented Jan 17, 2024

abstractqqq commented Dec 26, 2023 •

edited

Loading