Scraping

About this project

A scraper for one of Sweden's premier legal to scrape PDF URLs and metadata from this website https://www.domstol.se/hogsta-domstolen/avgoranden/.

The scraper navigates to the specified URL, applies filters, and extracts PDF URLs from the page source.
It then fetches metadata for each PDF and saves it to metadata.json and metadata.csv.
Latest 10 PDFs are downloaded to the specified directory.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Scraper		Scraper
helpers		helpers
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
config.py		config.py
env.templaate		env.templaate
exceptions.py		exceptions.py
requirements.txt		requirements.txt
run_dev.py		run_dev.py