Skip to content

Scrapy program that scrapes your saved, upvoad, .. posts and comments

Notifications You must be signed in to change notification settings

jurgonaut/scrapy-reddit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scrapy-reddit

This is a scrapy script that scrapes your saved, upvoated and other posts. By default the script stores the results in a csv file, but it can be easily changed to store them anywhere you want.

Usage

  • install requirements.txt
  • first you must create an app in https://www.reddit.com/prefs/apps/
  • add the client id and secret in settings.py or with env variables
  • add your reddit credentials in settings.py or with env variables
  • run the command scrapy crawl reddit while inside the project
  • check the results in path CSV_PATH (settings.py)

Run the tests with python3 tests.py

Extending the script

If you wish to do something else with the scraped items you can easily create a new middleware for the script, for more info on scrapy middleware check: https://docs.scrapy.org/en/latest/topics/downloader-middleware.html

About

Scrapy program that scrapes your saved, upvoad, .. posts and comments

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages