Skip to content
Change the repository type filter

All

    Repositories list

    • Shell
      1000Updated Dec 11, 2024Dec 11, 2024
    • Deployment configuration for all UKWA services stacks.
      Python
      Apache License 2.0
      55410Updated Dec 6, 2024Dec 6, 2024
    • The UKWA Heritrix3 custom modules and Docker builder.
      Java
      710451Updated Dec 2, 2024Dec 2, 2024
    • Secured browser for accessing NPLD content in Legal Deposit Library reading rooms.
      TypeScript
      MIT License
      1030Updated Sep 27, 2024Sep 27, 2024
    • Dashboard and monitoring system for the UK Web Archive
      Python
      5080Updated Sep 26, 2024Sep 26, 2024
    • Shell
      0000Updated Aug 29, 2024Aug 29, 2024
    • Shell
      0000Updated Aug 12, 2024Aug 12, 2024
    • WARC and ARC indexing and discovery tools.
      Java
      25117933Updated Aug 9, 2024Aug 9, 2024
    • Shell
      0000Updated Aug 1, 2024Aug 1, 2024
    • Shell
      0000Updated Aug 1, 2024Aug 1, 2024
    • Tools for working with UKWA crawler event streams
      Python
      Apache License 2.0
      3055Updated Jul 6, 2024Jul 6, 2024
    • crawl-db

      Public
      A standalone database for crawl events.
      Python
      Apache License 2.0
      1104Updated Jul 5, 2024Jul 5, 2024
    • Static basic UKWA website
      HTML
      1000Updated Jul 4, 2024Jul 4, 2024
    • ClamD in a container
      Dockerfile
      Apache License 2.0
      2000Updated May 20, 2024May 20, 2024
    • Apache Airflow with a few additional dependencies
      Dockerfile
      Apache License 2.0
      2000Updated Apr 25, 2024Apr 25, 2024
    • A Dockerised Robot Framework execution environment.
      RobotFramework
      Apache License 2.0
      2040Updated Apr 24, 2024Apr 24, 2024
    • Hadoop running in a container.
      Dockerfile
      Apache License 2.0
      2100Updated Apr 24, 2024Apr 24, 2024
    • An acid test suite for crawlers.
      PHP
      3470Updated Apr 24, 2024Apr 24, 2024
    • Python clients for W3ACT and Heritrix3
      Python
      30132Updated Apr 19, 2024Apr 19, 2024
    • Web page rendering service based on Google's Puppeteer
      JavaScript
      3212Updated Apr 11, 2024Apr 11, 2024
    • Containerised version of the Solr service used to generate the UKWA UI collections browser
      Python
      3001Updated Apr 11, 2024Apr 11, 2024
    • ukwa-site

      Public
      Using static site generation for parts of the our site.
      SCSS
      Apache License 2.0
      40121Updated Apr 11, 2024Apr 11, 2024
    • ukwa-ui

      Public
      A new user interface for the UK Web Archive
      Java
      BSD 3-Clause "New" or "Revised" License
      601183Updated Apr 10, 2024Apr 10, 2024
    • UKWA web apps for working with internal APIs, build on Jupyter notebooks and Voila.
      Jupyter Notebook
      2006Updated Jan 31, 2024Jan 31, 2024
    • A simple web service for viewing crawl logs.
      Python
      Apache License 2.0
      2013Updated Jan 30, 2024Jan 30, 2024
    • backstage

      Public
      UI for searching across internal services
      Ruby
      2017Updated Jan 30, 2024Jan 30, 2024
    • Run pdf2htmlEX in a Docker container.
      Python
      Apache License 2.0
      102410Updated Jan 30, 2024Jan 30, 2024
    • w3act

      Public
      w3act is an annotation and curation tool for building web archive collections
      Java
      Apache License 2.0
      619650Updated Jan 30, 2024Jan 30, 2024
    • Dockerized Apache Superset including Solr module
      Shell
      1010Updated Nov 16, 2023Nov 16, 2023
    • kevals

      Public
      Key-values data aggregator
      Python
      Apache License 2.0
      1000Updated Nov 16, 2023Nov 16, 2023