Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scaper wiki pdf tables tool #37

Open
mkalish opened this issue Jan 18, 2017 · 4 comments
Open

Scaper wiki pdf tables tool #37

mkalish opened this issue Jan 18, 2017 · 4 comments

Comments

@mkalish
Copy link
Member

mkalish commented Jan 18, 2017

https://pdftables.com/

This tool is no longer available for free, but the code is open sourced. The old tool had the ability to export the data directly to CKAN and this would be an extremely helpful feature for adding data to the portal

Tasks:

  • Run LMGTDFY on an EC2 instance and share the DNS
  • Investigate using the above tool to directly import data into CKAN
  • Investigate running pdf scaper wiki/ document what would be involved in writing our own
  • Look into Tabula as a potential alternative for scraping data
  • ...
@jalbertbowden
Copy link
Collaborator

another open source option as well
https://lmgtdfy.usopendata.org/

@jalbertbowden
Copy link
Collaborator

@jalbertbowden
Copy link
Collaborator

this tool scrapes ESRI REST services https://github.com/openaddresses/pyesridump

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants