Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 888 Bytes

README.md

File metadata and controls

20 lines (14 loc) · 888 Bytes

scrapy-reddit

This is a scrapy script that scrapes your saved, upvoated and other posts. By default the script stores the results in a csv file, but it can be easily changed to store them anywhere you want.

Usage

  • install requirements.txt
  • first you must create an app in https://www.reddit.com/prefs/apps/
  • add the client id and secret in settings.py or with env variables
  • add your reddit credentials in settings.py or with env variables
  • run the command scrapy crawl reddit while inside the project
  • check the results in path CSV_PATH (settings.py)

Run the tests with python3 tests.py

Extending the script

If you wish to do something else with the scraped items you can easily create a new middleware for the script, for more info on scrapy middleware check: https://docs.scrapy.org/en/latest/topics/downloader-middleware.html