-
Notifications
You must be signed in to change notification settings - Fork 5
Monthly URI Testing
On a Monthly basis, we perform URI link-checking to ensure the data in the Geoportal stays current. Each record or document in the Geoportal can contain multiple URIs: for metadata representations, for object downloads, for IIIF tiles, etc. Currently, ~16,000 docs contain roughly ~48,000 URIs.
- Check prior states
RAILS_ENV=production bundle exec rake geoportal:uri_states
- Purge URIs
RAILS_ENV=production bundle exec rake geoportal:uri_purge
- Process ALL URIs
RAILS_ENV=production bundle exec rake geoportal:uri_process_all
- Check run states
RAILS_ENV=production bundle exec rake geoportal:uri_states
- Re-run incomplete states
RAILS_ENV=production bundle exec rake geoportal:uri_queue_incomplete_states
Check that all background jobs have completed. When enqueued is 0 everything has processed.
RAILS_ENV=production bundle exec rake geoportal:sidekiq_stats
- Final run states
RAILS_ENV=production bundle exec rake geoportal:uri_states
- Produce report
RAILS_ENV=production bundle exec rake geoportal:uri_report
Download at something like: http://geo.btaa.org/2018-08-22_09-57-37.uri_report.csv
At the completion of the task, you'll have a "results.csv" file containing ~15,000 URIs and their result status. Each time I create a new Google Spreadsheet, and import this csv file, to share the data with the BTAA Geoportal folk.
They prefer to see the results in a pivot table. Here are past example spreadsheets to see: