Skip to content
Swati Jaiswal edited this page Sep 16, 2018 · 2 revisions

Welcome to the cdl-trial-project wiki!

Organisation of the Repository:

scraper

Contains the scraping scripts including the spiders which scrapes the datasets and creates CSVs and utils for extra stuff like cleaning old files to get them into proper format to merge with new files collected with different queries.

analysis

Contains the notebook modules which have plotting functions. Also a utils and wrangler module which wrangles the data and provides utilities for plots.

reports

Contains the analysis reports.

datasets

Contains the raw datasets that were scraped.

munged_datasets

Contains the munged datasets.

Clone this wiki locally