Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

疑问。 #5

Open
PapaMadeleine2022 opened this issue Jun 4, 2017 · 3 comments
Open

疑问。 #5

PapaMadeleine2022 opened this issue Jun 4, 2017 · 3 comments

Comments

@PapaMadeleine2022
Copy link

您好,这个想法是您自己的设计的? 还是哪篇论文的实现?

@zyymax
Copy link
Owner

zyymax commented Oct 18, 2017

simhash是来自google一篇论文,名字记不得了,你可以搜下

@arckalsun
Copy link

您好,如果有两篇文档A和B,A有200字。B有1000字,其中的200字是复制A的。用您的算法比对A和B,结果发现相似度很低。请问如何检测B抄袭了A?

@zyymax
Copy link
Owner

zyymax commented May 31, 2018

您好,这个工具不是用来查重的,只是用来对比全文相似性,不适合包含关系的两篇文档

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants