URL = http://ceomeghalaya.nic.in/erolls/erolldetails.html
Year = Draft Roll for 2018
conda env create -f tools/environment.yml
to install working environment andsource activate erolls
- Or,
pip install -r requirements.txt
if not using a conda environment tools/utils.py
is a helper function for downloading files, and sanity checkspython meghalaya.py
to downloads all the pdfs to directory../data/Meghalaya/
and creates 'Meghalaya.txt' for files that were not downloaded successfullypython meghalaya_retry.py
for retrying downloads for files in 'Meghalaya.txt'python meghalaya_SanityCheck.py
for doing a sanity check on the files downloaded
- Total Number of files = 3038
- The downloaded files are of form A{District number}_A{District number}{assembly constituency number}.pdf
- Files not available can be found in 'Meghalaya3.txt''