-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can we add new domains into existing LSH indexers? #3
Comments
I am facing a similar issue. lsh = MinHashLSH(threshold=0.5, num_perm=128)
lsh.insert("m2", m2)
result = lsh.query(m1)
#Pickle lsh
#Unpickle lsh later
lsh.insert("m3", m3) #I can add more MinHash(es) later and then query
result = lsh.query(m1) But with I have a setup where I want to:
How can I do this please? @ekzhu |
You will need to create another index for your new records. The created index itself is frozen and can't be updated. |
The code snippet is from datasketch Python library. For this Go library, there isn't an update option. |
Hi, I've read the original great paper 👍 and the repo's readme.md. Now I have a question: Can I add new domain records into an existing indexer?
For example if I create an indexer with 1 billion records using
After the creation, I get 1 million new records again. Can I add them to the exist index_eqd? Or I can only create a new indexer with 1 billion + 1 million records.
The text was updated successfully, but these errors were encountered: