-
Notifications
You must be signed in to change notification settings - Fork 350
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Please add plain text output functionality #150
Comments
I highly recommend using html2text library on the .summary() output for that.
given that it's that easy and that different people need different rendering options, and the options might change over time and I would need to reflect them in the library interface, I'd like to leave it as is. |
Shameless plug: trafilatura builds upon |
So I'll leave the issue opened until you decide whether you want to add it, right? |
Is there a plan to support |
Yes if many people want an easy way to have text output, I'll add it. |
Could you please support to get clear text content? |
Like .summary() but plain text instead of the .summary() html version.
E.g. as a new method or as an argument for the .summary() method.
That would be very useful for Natural Language Processing.
The text was updated successfully, but these errors were encountered: