electoral_rolls/punjab at master · in-rolls/electoral_rolls

readme.md

Total Number of Files = 22,883

The Script does three things:

Produces punjab.csv that contains metadata about the pdfs. The CSV has the following fields: district_name, assembly_constituency, ero (intermediate HTML table), mla (intermediate HTML table), mp (intermediate HTML table), total_voters (intermediate HTML table), part_no (from the final HTML table), area_covered (again final HTML table), polling_station_building (final HTML table), filename
Downloads intermediate HTML files
Downloads all the pdfs to a directory called punjab_pdfs/

pip install -r requirements.txt
python punjab.py