Zevenheuvelenloop Uitslag Scraper

Automated scraper build with Scrapy that scrapes data from the running event Zevenheuvelenloop. The data gets scraped from http://evenementen.uitslagen.nl/.

The scraper process works as follows:

The allowed domain gets set to evenementen.uitslagen.nl
The max_id value should be changed to the maximum amount of pages that you want to have scraped.
An example of the final url can be ~uitslag01233.html"
The scraper will scrape the set amount of pages and add those to the csv file until done.

Example with Airbnb's Superset

Why???

After competing in the event myself I wanted to have some answers based on the data that resulted from the competition. This included questions like:

What was the average finish time of a certain category.
How much people did a person overtake in his own category.
How much people did a person overtake given all categories
Etc etc..

How to use

You can use the scraper by using the command

scrapy runspider ZevenHeuvelSpider_spider.py -o zevenheuvel.csv

After completion, you can open the zevenheuvel.csv for inspection.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
img		img
zevenheuvelloopscraper		zevenheuvelloopscraper
.gitignore		.gitignore
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zevenheuvelenloop Uitslag Scraper

Example with Airbnb's Superset

Why???

How to use

Resources Used

About

Releases

Packages

Languages

Frituurpanda/Zevenheuvelenloop-Scraper

Folders and files

Latest commit

History

Repository files navigation

Zevenheuvelenloop Uitslag Scraper

Example with Airbnb's Superset

Why???

How to use

Resources Used

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages