Skip to content

Latest commit

 

History

History
507 lines (329 loc) · 18.4 KB

CHANGELOG.md

File metadata and controls

507 lines (329 loc) · 18.4 KB

CHANGELOG

All notable changes to this project will be documented here. For more details, visit the wiki: https://script-ware.gitbook.io/cyberdrop-dl

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

UNRELEASED

Added

  • members.luscious.net support
  • Coomer search results support

Fixed

  • bunkr reinforced link handling
  • Handle encoded URLS found while scraping (every crawler)

[6.2.0] - 2025-01-10

Added

  • PixHost.to support
  • Forum.AllPornComix.com support (forum)
  • Bellazon.com support (forum)
  • TitsInTops.com support (forum)
  • Video support for chevereto sites (imagepond.net, img.kiwi and jpg5.su)
  • Filesystem URLs support for images and videos (pixeldrain)
  • Indirect links support (saint)

Changed

  • Better MediaFire errors handling
  • Better RealDebrid errors handling
  • Do not overload 403 and 404 errors
  • Use canonical URLs for files in database (chevereto)

Fixed

  • Files being hashed twice when using Hashing.IN_PLACE
  • Post information parsing (reddit)
  • Sub-path parsing, ex: submitted, comments (reddit)
  • "Loose Files" folder not being created (all crawlers)
  • Delete startup log file if empty
  • Make a single GoFile token acquisition request
  • Make a single login request try per forum
  • DDoS-Guard detection

[6.1.1] - 2025-01-05

Changed

  • Remove crawlers scrape lock

Fixed

  • v5 to v6 transfer [database] (hash table missing on second startup)
  • v4 to v6 transfer [logs] ('downloader.log' being used by another process)

[6.1.0] - 2025-01-04

Fixed

  • Fallback to old file password check (cyberfile)
  • Handle stream redirects (bunkr)
  • Files being saved with the current date instead of the upload date
  • Sanitize filenames before downloading
  • Apprise notifications integration
  • Log startup errors to a file (validation, YAML, database and unknown errors)
  • Miscalculation of download and scrape queue length
  • Sorting videos with invalid or missing fps metadata
  • e-Hentai crawler

Changed

  • Set 512MB as the minimum required free space
  • --console-log-level will have no effect unless --no-ui was used
  • OS notification are enabled by default on Windows

Deprecated

  • --output-folder is deprecated. Use --download-folder
  • v4 support (importing v4 config and files) will be removed in a future version

Added

[6.0.1] - 2024-12-24

Fixed

  • Database v5 to v6 transfer

[6.0.0] - 2024-12-23

⚠️BREAKING CHANGES

  1. Stricter config validation logic. CDL will verify that every config value and CLI argument is valid before running
  1. All dedupe and hashing functions has been replaced.
  • xxh128 is now the default hashing algorithm, replacing md5
  • By default, all v5 hashes will be ignored for deduplication matches. You can re-enable md5 with --add-md5-hash
  • It's no longer possible to keep the newest file while deduplicating.
  • Every file previously downloaded is taken into account for deduplication. Files no longer need to currently exists on disk
  1. Removed --sort-cdl-only option
  1. The following arguments are deprecated and will be removed in a future version:
  • --output-folder
  • --download-all-configs
  • --sort-all-configs
  • --sort-all-downloads
  • --main-log-filename
  • --last-forum-post-filename
  • --unsupported-urls-filename
  • --download-error-urls-filename
  • --scrape-error-urls-filename

This update introduces the following changes:

  • Add Python 3.13 support
  • Add requests caching to speed up future download sessions for forums, Coomer, and Kemono.
  • Update Kemono & Coomer crawlers to use API calls
  • Update GoFile crawler to the new site design
  • Update Bunkr crawler to handle new UUID filename format
  • Rework entire deduplication and hashing logic. Now multiple hashing algorithms are supported
  • Use Pydantic for config and CLI values validation
  • Allow downloading duplicate files on the same run with --ignore-history
  • Fix sorting functionality
  • Allow running all configs sequentially using --config ALL
  • Better cookies management system
  • Added option to clear cookies for selected websites
  • General improvements and bug fixes

[5.7.2] - 2024-11-20

This update introduces the following changes:

  1. Add option to use cookies from any supported site
  2. Apply cookies from flaresolverr when possible, even if the response is invalid
  3. Add option to automatically import cookies at startup
  4. Better validation of config values
  5. Rework entire TUI user input options
  6. General logging improvements and bug fixes

Details

  • User can import cookies from their browser. CDL will use these cookies to login to websites and pass Cloudflare DDoS challenges. For more information on cookies extraction and configuration, visit: https://script-ware.gitbook.io/cyberdrop-dl/reference/configuration-options/settings/browser_cookies
  • When using flaresolverr, CDL will try to apply the cookies from the response and make a new request if necessary.
  • User can set CDL to automatically import cookies at startup. User must specify browser and domains to export cookies from
  • Add logic validation for config path values
  • Remove integrated config edit options. Modifications to the config must be done directly on the config file.

[5.7.1] - 2024-11-05

⚠️BREAKING CHANGES

All output files (except for the main log file) are now CSV files with headers for each column (Scrape_Errors, Download_Errors, Unsupported_URLs and Last_Forum_Post). A custom filename for each file can still be set via config, but the extensions will always be .csv

This update introduces the following changes:

  1. Adds the option to limit how many items are scraped
  2. Add support for scraping a users' coomer favorites
  3. Add integration to handle downloads supported by https://real-debrid.com
  4. Add support for https://nekohouse.su profiles and posts
  5. Add support for https://imagepond.net URLs
  6. Add support for password protected albums from Chevereto sites (jpg5, Img.kiwi and Imagepond)
  7. Show total runtime and total downloaded data on final report
  8. Add support to send the main log file as an attachment to the webhook_url report
  9. Add support to sent CDL report via email, telegram and many other services via Apprise
  10. Add support for % encoded URLs in the input file
  11. General logging improvements and bug fixes

Details:

  • Users can limit the number of items to scrape by type, using the --maximum-number-of-children parameter. For more details on how to use this feature, visit the wiki: https://script-ware.gitbook.io/cyberdrop-dl/reference/configuration-options/settings/download_options
  • Add support for scraping a users' coomer favorites by allowing the user to pass the coomer favorites page URL as an input URL (https://coomer.su/favorites). This requires them to have their coomer session token in the authentication.yaml file.
  • Add real-debrid integration to download from any site that they support (mega.nz,rapidgator, google drive, k2s, etc). User needs to provide their API key in the authentication.yaml file in order to allow downloads
  • Nekohouse URLs can now be scraped and downloaded by CDL
  • Users can now get the stats report of the run via multiple services and include the main log as an attachment. For more information on how to setup notifications, visit: https://script-ware.gitbook.io/cyberdrop-dl/reference/notifications
  • Fix parsing of bunkr file extensions when --remove-generated-id is enabled
  • Remove console markdown data from log files
  • Fix only_hosts skip logic
  • Better handling of some unknown errors

[5.7.0] - 2024-10-25

This update introduces the following changes:

  1. Rotating log files
  2. Overhaul hashing functions
  3. Add support for https://tokyomotion.net URLs
  4. Add support for https://xxxbunker.com URLs
  5. Add support for https://saint2.su albums
  6. Add support for password protected Cyberfile URLs
  7. Simplify some UI elements
  8. Improve jdownloader integration
  9. Implement rich logger
  10. Add a "Check for Updates" UI option
  11. General bug fixes

Details:

  • Add option to rotate log file names. If enabled, current date-time will be used as a suffix for each log file, in the format YYMMDD_HHMMSS. This will prevent overriding old log files
  • Refactor hashing functions and logic
  • Add support for videos, photos, albums, playlist, profiles and search results of tokyomotion.net
  • Add support for playlists, search results and video downloads on xxxbunker.com
  • Add support for saint2.su album URLs
  • Add support for both password protected files and folders on Cyberfile. Users can include the password as a query parameter in the input URL, adding ?password=<URL_PASSWORD> to it. Example: https://cyberfile.me/folder/xUGg?password=1234
  • Replace built-in log file handler with rich handler for better error reports
  • UI changes: remove redundant 'X of Y files' from every progress bar, sort scrape and download error by reverse frequency, use equal height for top row UI, fix padding issues, show unsupported URLs stats at the end
  • Add whitelist filter, autostart and custom download_dir options for jdownloader. For more details, visit the wiki: https://script-ware.gitbook.io/cyberdrop-dl/reference/configuration-options/settings/runtime_options
  • Added a "Check for Updates" UI option and improved the update check logic to check for new testing versions.
  • Fix error during program exit when referers table no longer exists
  • Prevents crashes when there are insufficient permissions to move a file
  • Fix an issue where CDL would delete URLs input file
  • Move functions for after download to post_runtime
  • Fix handling of deleted Imgbox albums if they return HTTP 200
  • Fix Cyberfile deleted folders not being correctly scraped
  • Update logging to include when a file is being downloaded with no crawler

[5.6.54] - 2024-10-21

This update introduces the following changes:

  1. Fix error on some bunkr videos

Details:

  • Fix error when downloading videos with no thumbnail (bunkr)
  • Update possible CDNs (bunkr)
  • Better error handling (bunkr)

[5.6.53] - 2024-10-20

This update introduces the following changes:

  1. Update bunkr crawler

Details:

  • Update bunkr crawler to work on the new site design

[5.6.52] - 2024-10-10

This update introduces the following changes:

  1. Fix scan_folder saved as invalid value

Details:

  • Fixes issue that causes the config file to be corrupted with an invalid scan_folder value.

[5.6.51] - 2024-10-10

This update introduces the following changes:

  1. Skip file download by referer
  2. Fixes album_id not been saved to database

Details:

  • Using the flag --skip-referer-seen-before will skip downloading files from any referer that have been scraped before. The file (s) will always be skipped regardless of whether the referer was successfully scraped or not
  • Fixes album_id property not being saved to database on supported crawlers

[5.6.50] - 2024-10-07

This update introduces the following changes:

  1. Support for password protected GoFile links

Details:

  1. Users can include the password as a query parameter in the input URL, adding ?password=<URL_PASSWORD> to it. Example: https://gofile.io/d/xUprGg?password=1234

[5.6.43] - 2024-10-03

This update introduces the following changes:

  1. Update True/False CLI arguments to integrate better with the config file.

Details:

  • CLI arguments that toggle settings to True or False can now be passed as either --arg or --no-arg to set the value to True or False respectively.
  • This also solves an issue where CLI arguments that toggle settings would override config file settings even if the CLI argument was never passed.

[5.6.42] - 2024-10-03

This update introduces the following changes:

  1. Filter final URL with --skip-hosts and --only-hosts arguments

Details:

  • This allows the user to skip or only download from specific bunkr hosts

[5.6.41] - 2024-10-01

This update introduces the following changes:

  1. Fixes crash when unsupported URLs have no parents
  2. Display new changelog if an update is available
  3. Updated supported sites in Wiki

Details:

  • Fixes crash if an unsupported url have no parents
  • Always display an updated changelog if a new version has been released on pypi
  • Remove SimpCity from supported websites on Wiki

[5.6.40] - 2024-10-01

This update introduces the following changes:

  1. Fixes empty folder cleanup

Details:

  • Fix incorrect path objects on post-runtime folder cleanup

[5.6.39] - 2024-09-30

This update introduces the following changes:

  1. Add external CHANGELOG file

Details:

[5.6.38] - 2024-09-30

This update introduces the following changes:

  1. Fix scrape_items creation for Kemono and coomer links

Details:

  • Fixes parents tracking for Kemono and coomer links

[5.6.37] - 2024-09-30

This update introduces the following changes:

  1. Fixes empty folder cleanup on python 3.11

Details:

  • Fixes logic by walking the directory tree using os.walk to remain compatibility with python 3.11

[5.6.36] - 2024-09-30

This update introduces the following changes:

  1. Delete empty files after a successful run.
  2. Added a feature to save the origin of unsupported URLs

Details:

  • Empty files (0 bytes) inside download_dir will be deleted alongside empty folders after a successful run
  • Each unsupported URL will now be saved alongside the URL of the original item they came from (Unsupported_URLs.txt)
  • Origin is also saved for password protected links, allowing the user to visit the URL (ex. forum post) and retrieve the password if available

[5.6.35] - 2024-09-30

This update introduces the following changes:

  1. Small fixes for sorting system

Details:

  • Fixes scan_dir selection logic

[5.6.34] - 2024-09-30

This update introduces the following changes:

  1. Added a feature to fix the names of multipart archives.

Details:

  • Multipart archives will be renamed to have the proper naming format when the --remove-generated-id-from-filenames argument is passed.

[5.6.33] - 2024-09-25

This update introduces the following changes:

  1. Fix issues with checking how much free space is available on the disk.
  2. Skip clearing the console when running with the --no-ui flag.

Details:

  • Fixed an issue with free space not being properly checked when running with the --retry-failed flag.
  • Made error output more clear when there is not enough free space to download a file.
  • Skip clearing the console when running with the --no-ui flag to allow users to see the output of all runs done with --no-ui.

[5.6.32] - 2024-09-22

This update introduces the following changes:

  1. Add new URLs categorization feature for the URLs.txt file.

Details:

  • You can now group links under one download folder by adding a category name above the links in the URLs.txt file.
  • The category name must be prefixed by three dashes (---) and must be on a new line.
  • The category name will be used as the folder name for the links that follow it.
  • To end a category, add three dashes (---) on a new line after the links.
  • You can have multiple categories in the URLs.txt file, and the links will be grouped accordingly.

For more details, visit the wiki: https://script-ware.gitbook.io/cyberdrop-dl/reference/configuration-options/settings/sorting

[5.6.30] - 2024-09-21

SIMPCITY HAS BEEN REMOVED FROM THE SUPPORTED WEBSITES LIST.

TO SEE WHY, VISIT THE WIKI: https://script-ware.gitbook.io/cyberdrop-dl/simpcity-support-dropped

[5.6.20] - 2024-09-19

This update introduces the following changes:

  1. Ability to scrape URLs from PixelDrain text post

Details:

  • Cyberdrop-DL will now scrape URLs from PixelDrain text posts and make a folder for all the URLs within the post, reducing clutter.

[5.6.13] - 2024-09-19

This update introduces the following changes:

  1. Per-config authentication settings

Details:

  • If an authentication.yaml file is placed within your config directory, it will be used instead of the global authentication values.

[5.6.12] - 2024-09-19

This update introduces the following changes:

  1. Reformat code and organize imports

Details:

  • Reformatted code to be more readable and removed unused imports.

[5.6.11] - 2024-09-16

This update introduces the following changes:

  1. Detect and raise an error for private GoFile folders

Details:

  • Private GoFile folders will now raise an error when attempting to download them instead of crashing CDL

[5.6.1] - 2024-09-13

This update introduces the following changes:

  1. Fixes issue with --sort-all-downloads
  2. Improves sort status visibility

Details:

  • The sort status is now display under hash, along with other statuses
  • --sort-all-downloads is disabled by default, thus only cdl downloads are sorted without the flag
  • sort_folder can not be the same as the scan_dir

[5.6.0] - 2024-09-13

This update introduces the following changes:

  1. Updated the sorting progress UI to display more information.
  2. Removed unused functions from progress bars.

Details:

  • The sorting UI now displays the progress of each folder as it is being processed, including the number of files that have been sorted and the percentage of the folder that has been processed.
  • The sorting UI now also shows what folders are in the queue to be sorted.

[5.5.1] - 2024-09-12

This update introduces the following changes:

  1. Small fixes for sorting system

Details:

  • use - instead of _ for new arguments
  • fix bug where purge_dir is called for each file, instead of each directory when done

[5.5.0] - 2024-09-12

This update introduces the following changes:

  1. Finalizes new sorting feature
  2. add scanning directory for sorting
  3. adds progress bar for sorting

Details:

  • skips need to scan database if sort_cdl_only is false
  • progress bar for current progress of sorting files, incremented for each folder
  • allow for setting a different folder to scan that is independent of the download folder