-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question about the readability module dependency: mozila or luin ? #14
Comments
Good question – I think I just took the readability project that appeared first in my npm search. And in the meantime I was actually trying out https://www.npmjs.com/package/article-parser – yet still having some issutes with it. Do you know the project? I understand you recommend using
? |
No never heard about it. Actually I search more in github than npm registry for this kind of things. But what I can tell you from my experience in the last years is that many of these node modules or readability libraries make a big splash at the beginning, but end up totally abandoned and unmaintained after a few months. No matter if it is Node or Go or Python. So I tend to watch for the ones one cant trust for the long run.
I'm not a developer or programmer. And I wouldnt' touch node even with a pole :-).
It seems before you had to do
But now you can do,
You can't do
Because someone else already took that name "readability". But I've been collectinng and testing lotz of this kind of "readability" apps in node/python/etc for the last year. Since the Firefox Reader Mode seems to do a good job and Mozilla has lotz of resources and developers I always "assumed" that the Mozilla node one might be the best one to use. Because I dont understand node, I could never make a simple ready made cli tool out of the mozilla library myself. That's why I looked for others ready made, even though I dont like to install node on my system. And I hate having to pull dozens or hundreds of small node modules as dependencies. But honestly, after having tested so much of these readability extractors, I think it doest make that big difference at all. It's very dependable on the website. Modern websites are so complicated.. Sometimes even the simplest python script based on the original readability algorithm does the job accetably. Let me see if I can find more (node tools) in my notes: Mozilla Readability based:
Not Mozilla Readability based:
|
A small question about the readability dependency. On the README.md page you write:
where "Readability" links to, https://github.com/mozilla/readability
But your project actually uses, https://github.com/luin/readability , which actually installs a module called "node-readability"
I know that "luin" is probablily a fork or something pulling from "mozilla".
I just wanted to make sure there is a reason for this, and for not pulling directly from mozilla.
I ask this becaus I've been testing dozens of node based readability projects, and very frequently because they choose to name their binary "readability" you end up with a mess of different packages and/or their installed binary named "readability" .
The text was updated successfully, but these errors were encountered: