Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Gensim similarity text analysis #213

Open
sethwoodworth opened this issue Dec 19, 2012 · 0 comments
Open

[Feature] Gensim similarity text analysis #213

sethwoodworth opened this issue Dec 19, 2012 · 0 comments

Comments

@sethwoodworth
Copy link
Member

This is a big feature, and is listed here as a placeholder for the conversation of when or if to add it.


Gensim is a free Python framework designed to automatically extract
semantic topics from documents, as efficiently (computer-wise) and
painlessly (human-wise) as possible.
...
Once these statistical patterns are found, any plain text documents can
be succinctly expressed in the new, semantic representation, and
queried for topical similarity against other documents.

http://radimrehurek.com/gensim/intro.html

There is also a pre-packaged server implementation of the library, that looks like it would be ideal as a dedicated processing server for document's similarity.

https://github.com/piskvorky/gensim-simserver

It uses an extreme free software license, the AGPL

This means you may use simserver freely in your application (even
commercial application!), but you must then open-source your
application as well, under an AGPL-compatible license.

But luckily for us, our license is totally compatible with theirs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant