We scrape electoral rolls from https://ceouttarpradesh.nic.in/rollpdf/rollpdf.aspx
We split scraping into two steps:
- harvesting all the links --- CSV with links to the pdf is here
- downloading the pdfs
The PDFs are posted to GCS on archival storage and available under requester pays. The PDFs are over 600 GB.