[FEATURE] save Unsupported_URLs as csv instead of txt and preserve old values #864

baccccccc · 2024-03-30T04:34:35Z

Is your feature request related to a problem? Please describe.
After I finish downloading a particular forum thread, I usually take some time to go through the Unsupported_URLs.txt manually. For each link, I decide what to do. Often the solution is to go there with a browser and download it manually. Sometimes the link is just something not really valuable or helpful.

This is of course time consuming, and I would very much prefer to not have to do it again and again for the links I already investigated.

But at some time, I will re-attempt downloading the same forum thread. And there will be new entries in Unsupported_URLs.txt. I will try to investigate them as well. But it's hard to remember where you stopped last time and what you did to this or that link. So I will end up either doubting myself or going through the same list from the top and checking again the links I already investigated.

Describe the solution you'd like

Save Unsupported_URLs as a CSV instead of TXT.
Each time I do something to a particular link, I would add a free form comment to a separate column in this file. (Can be anything, but think of examples such as “downloaded manually on yyyy-mm-dd” or “dead link”, etc.)
Each time the script runs, it will append the file instead of overwriting it.
3a. Each time script encounters an unsupported URL, it should check whether there's already an entry for the same URL in the CSV.
3b. If there's an existing entry for the same URL, keep it intact and do not add a new one. (More specifically, keep both the URL and whatever other columns might be added after it.)
3c. If an existing entry is not found, add it to the end, and keep the comment blank.
Either way, the script should only care about the 1st column in the file (URL) and ignore any other columns in case they contain user comments.
Under no circumstances script should remove entries from this file. (Unless, I guess, a previously unsupported URL becomes supported and you download it successfully. Then it's fine to remove.)

The text was updated successfully, but these errors were encountered:

ClonedBoy · 2024-04-26T17:37:28Z

You can just import the TXT as CSV and do it separately. What I would usually do is create a separate file and then remove dups (with LibreOffice for example). +1 for the appending option, but the other things defeat the purpose of a downloader vs a dead-links tracker, imho. But if it is easy to code for the dev, then why not.

baccccccc added the enhancement New feature or request label Mar 30, 2024

baccccccc assigned Jules-WinnfieldX Mar 30, 2024

baccccccc mentioned this issue Sep 28, 2024

[FEATURE] Please consider re-enabling simpcity (and other questions) jbsparrow/CyberDropDownloader#134

Closed

baccccccc mentioned this issue Oct 26, 2024

Miscellaneous logging improvements (was: password protected links are not included to Scrape_Error_URLs.csv) jbsparrow/CyberDropDownloader#210

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] save Unsupported_URLs as csv instead of txt and preserve old values #864

[FEATURE] save Unsupported_URLs as csv instead of txt and preserve old values #864

baccccccc commented Mar 30, 2024 •

edited

Loading

ClonedBoy commented Apr 26, 2024

[FEATURE] save Unsupported_URLs as csv instead of txt and preserve old values #864

[FEATURE] save Unsupported_URLs as csv instead of txt and preserve old values #864

Comments

baccccccc commented Mar 30, 2024 • edited Loading

ClonedBoy commented Apr 26, 2024

baccccccc commented Mar 30, 2024 •

edited

Loading