-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sparse RangeSearch/AnnIterator to return raw distance #944
sparse RangeSearch/AnnIterator to return raw distance #944
Conversation
b35b63e
to
c53e0d0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The idea is clear (return raw distances in case of the use of threshold for removing small values), but please consider altering names a bit in order to avoid the confusion while reading. I'd change quantization
to some other name.
@@ -502,6 +515,11 @@ class InvertedIndex : public BaseInvertedIndex<T> { | |||
return n_cols_internal(); | |||
} | |||
|
|||
[[nodiscard]] virtual bool | |||
HasQuantization() const override { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
According to what I understand, this is not Quantization
, this is Pruning
or maybe Whitening
. Correct me if I'm wrong, but this drop_during_build_
indicates whether a threshold is used for removing small values.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
changed to IsApproximated
. wanted to express whether the index is accurate kNN or ANN so not choosing a name that reflects how we do the approximation.
c53e0d0
to
67dbe9b
Compare
…be the raw instead of the quantized distance Signed-off-by: Buqian Zheng <[email protected]>
…read/write is needed Signed-off-by: Buqian Zheng <[email protected]>
037653d
to
b2e3baa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: foxspy, zhengbuqian The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #944 +/- ##
=========================================
+ Coverage 0 74.22% +74.22%
=========================================
Files 0 82 +82
Lines 0 6615 +6615
=========================================
+ Hits 0 4910 +4910
- Misses 0 1705 +1705 |
* sparse: make the distance returned by RangeSearch and AnnIterator to be the raw instead of the quantized distance Signed-off-by: Buqian Zheng <[email protected]> * sparse: remove mutex in the index: we now use CC index if concurrent read/write is needed Signed-off-by: Buqian Zheng <[email protected]> --------- Signed-off-by: Buqian Zheng <[email protected]>
* sparse: make the distance returned by RangeSearch and AnnIterator to be the raw instead of the quantized distance Signed-off-by: Buqian Zheng <[email protected]> * sparse: remove mutex in the index: we now use CC index if concurrent read/write is needed Signed-off-by: Buqian Zheng <[email protected]> --------- Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]>
* update raft to 24.10 (#914) Signed-off-by: yusheng.ma <[email protected]> Signed-off-by: xianliang.li <[email protected]> * fix Index parameters handling and anniterator (#913) Signed-off-by: Alexandr Guzhva <[email protected]> Signed-off-by: xianliang.li <[email protected]> * add range check (#915) Signed-off-by: xianliang.li <[email protected]> * fix knowhere ut (#918) Signed-off-by: xianliang.li <[email protected]> * raft index supports cosine similarity by normalizing the input data. (#924) Signed-off-by: yusheng.ma <[email protected]> Signed-off-by: xianliang.li <[email protected]> * compensate for the missing acceleration functions in ARM NEON. (#922) Signed-off-by: yusheng.ma <[email protected]> Signed-off-by: xianliang.li <[email protected]> * improve sparse vector index mmap: to mmap almost everything (#928) Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]> * move sparse index Add to build pool (#933) Add SparseInvertedIndexNodeCC to allow being thread safe growing index Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]> * sparse mmap on disk (#935) Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]> * use MAP_PRIVATE for mmapped file (#938) Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]> * sparse RangeSearch/AnnIterator to return raw distance (#944) * sparse: make the distance returned by RangeSearch and AnnIterator to be the raw instead of the quantized distance Signed-off-by: Buqian Zheng <[email protected]> * sparse: remove mutex in the index: we now use CC index if concurrent read/write is needed Signed-off-by: Buqian Zheng <[email protected]> --------- Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: xianliang.li <[email protected]> * Add optimized distance functions for PowerPC (#894) Added the PowerPC vector functions in src/simd/distances_powerpc.cc, src/simd/distances_powerpc.h. The hooks to the PowerPC functions are added in src/simd/hook.cc. Signed-off-by: Carl Love <[email protected]> Co-authored-by: Carl Love <[email protected]> Signed-off-by: xianliang.li <[email protected]> * enhance: optimize get norms function (#950) Signed-off-by: cqy123456 <[email protected]> Signed-off-by: xianliang.li <[email protected]> --------- Signed-off-by: yusheng.ma <[email protected]> Signed-off-by: xianliang.li <[email protected]> Signed-off-by: Alexandr Guzhva <[email protected]> Signed-off-by: Buqian Zheng <[email protected]> Signed-off-by: Carl Love <[email protected]> Signed-off-by: cqy123456 <[email protected]> Co-authored-by: presburger <[email protected]> Co-authored-by: Alexander Guzhva <[email protected]> Co-authored-by: Buqian Zheng <[email protected]> Co-authored-by: carll99 <[email protected]> Co-authored-by: Carl Love <[email protected]> Co-authored-by: cqy123456 <[email protected]>
instead of the quantized distance
/kind improvement