Skip to content

HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.

License

Notifications You must be signed in to change notification settings

Und3r1ine/proxy-scraper-checker

 
 

Repository files navigation

proxy-scraper-checker

Screenshot

HTTP, SOCKS4, SOCKS5 proxies scraper and checker.

  • Asynchronous.
  • Uses regex to search for proxies (ip:port format) on a web page, allowing proxies to be extracted even from json without making changes to the code.
  • It is possible to specify the URL to which to send a request to check the proxy.
  • Can sort proxies by speed.
  • Supports determining the geolocation of the proxy exit node.
  • Can determine if the proxy is anonymous.

You can get proxies obtained using this script in monosans/proxy-list.

Installation and usage

  • Download and unpack the archive with the program.
  • Edit config.ini according to your preference.
  • Install Python (minimum supported version is 3.7). During installation, be sure to check the box Add Python to PATH.
  • Install dependencies and run the script. There are 2 ways to do this:
    • Automatic:
      • On Windows run start.cmd
      • On Unix-like OS run start.sh
    • Manual:
      1. cd into the unpacked folder
      2. Install dependencies with the command python -m pip install -U --no-cache-dir --disable-pip-version-check pip setuptools wheel; python -m pip install -U --no-cache-dir --disable-pip-version-check -r requirements.txt
      3. Run with the command python -m proxy_scraper_checker

Folders description

When the script finishes running, the following folders will be created (this behavior can be changed in the config):

  • proxies - proxies with any anonymity level.
  • proxies_anonymous - anonymous proxies.
  • proxies_geolocation - same as proxies, but includes exit-node's geolocation.
  • proxies_geolocation_anonymous - same as proxies_anonymous, but includes exit-node's geolocation.

Geolocation format is ip:port|Country|Region|City.

Buy me a coffee

Ask for details on Telegram or VK.

License

MIT

About

HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.0%
  • Other 2.0%