Skip to content
@impresso

Media Monitoring of the Past

Media Monitoring of the Past - Beyond Borders: Connecting Historical Newspapers and Radio.

Impresso Project Logo

About

Hi there 👋 !

Impresso - Media Monitoring of the Past is an interdisciplinary research project that uses machine learning to pursue a paradigm shift in the processing, semantic enrichment, representation, exploration and study of historical media across modalities, temporal, linguistic, and national borders. The project has received two rounds of funding, from 2017-2020 and 2023-2027 (hence, there is code from both periods).

We design and develop the Impresso Web App and the upcoming Impresso Datalab (coming soon), while conducting research at the intersection of Natural Language Processing, Design, and History. Find more details on the project website.

Contents

This GitHub organization hosts numerous repositories dedicated to:

  • the code behind the Web App and Datalab. While a few repositories are public, many are still private. We aim to document and release code properly as it matures and becomes ready;
  • code supporting research efforts;
  • code from student projects.

More information and highlights will be shared as we continue to make progress! In addition to the public repositories listed below, you can also check out our models on the Impresso Hugging Face organisation.

Impresso 2 release history

(to come)

Popular repositories Loading

  1. named-entity-tutorial-dh2019 named-entity-tutorial-dh2019 Public

    Tutorial on NE processing for Digital Humanities - DH Utrech 2019

    Jupyter Notebook 25 4

  2. CLEF-HIPE-2020 CLEF-HIPE-2020 Public

    Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.

    SCSS 22 5

  3. NZZ-black-letter-ground-truth NZZ-black-letter-ground-truth Public

    8 1

  4. impresso-text-acquisition impresso-text-acquisition Public

    🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

    Jupyter Notebook 7 2

  5. impresso-frontend impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    Vue 5

  6. impresso.github.io impresso.github.io Public

    HTML 3 4

Repositories

Showing 10 of 46 repositories
  • impresso-linguistic-processing Public

    Code for running spaCy on rebuilt impresso data.

    impresso/impresso-linguistic-processing’s past year of commit activity
    Python 0 AGPL-3.0 0 0 0 Updated Nov 30, 2024
  • impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    impresso/impresso-frontend’s past year of commit activity
    Vue 5 AGPL-3.0 0 165 (2 issues need help) 3 Updated Nov 30, 2024
  • impresso-user-admin Public

    Basic Django admin to manage user-related data in Impresso's Master DB.

    impresso/impresso-user-admin’s past year of commit activity
    Python 1 AGPL-3.0 0 7 0 Updated Nov 30, 2024
  • impresso-middle-layer Public

    Middle layer API

    impresso/impresso-middle-layer’s past year of commit activity
    JavaScript 0 AGPL-3.0 1 15 13 Updated Nov 29, 2024
  • impresso-schemas Public

    Repository of JSON schemas used in the Impresso project.

    impresso/impresso-schemas’s past year of commit activity
    Python 3 AGPL-3.0 3 3 0 Updated Nov 29, 2024
  • impresso-py Public

    Impresso Python Library to interact with the Impresso Public API

    impresso/impresso-py’s past year of commit activity
    Python 0 AGPL-3.0 0 7 1 Updated Nov 28, 2024
  • transmedia Public

    Website for the Transmedia History Conference

    impresso/transmedia’s past year of commit activity
    HTML 1 AGPL-3.0 0 0 0 Updated Nov 28, 2024
  • llm-transcript-postcorrection Public

    Work on OCR/ASR/HTR post-correction.

    impresso/llm-transcript-postcorrection’s past year of commit activity
    Jupyter Notebook 1 AGPL-3.0 0 0 0 Updated Nov 27, 2024
  • impresso-datalab Public

    Impresso Datalab static Astro website

    impresso/impresso-datalab’s past year of commit activity
    MDX 0 AGPL-3.0 0 11 0 Updated Nov 26, 2024
  • impresso-essentials Public

    ⚙️ Python package highly reusable modules and functions within impresso.

    impresso/impresso-essentials’s past year of commit activity
    Python 0 GPL-3.0 1 4 1 Updated Nov 25, 2024

Top languages

Loading…

Most used topics

Loading…