Skip to content
@impresso

Media Monitoring of the Past

Media Monitoring of the Past - Beyond Borders: Connecting Historical Newspapers and Radio.

Impresso Project Logo

About

Hi there đź‘‹ !

Impresso - Media Monitoring of the Past is an interdisciplinary research project that uses machine learning to pursue a paradigm shift in the processing, semantic enrichment, representation, exploration and study of historical media across modalities, temporal, linguistic, and national borders. The project has received two rounds of funding, from 2017-2020 and 2023-2027 (hence, there is code from both periods).

We design and develop the Impresso Web App and the upcoming Impresso Datalab (coming soon), while conducting research at the intersection of Natural Language Processing, Design, and History. Find more details on the project website.

Contents

This GitHub organization hosts numerous repositories dedicated to:

  • the code behind the Web App and Datalab. While a few repositories are public, many are still private. We aim to document and release code properly as it matures and becomes ready;
  • code supporting research efforts;
  • code from student projects.

More information and highlights will be shared as we continue to make progress! In addition to the public repositories listed below, you can also check out our models on the Impresso Hugging Face organisation.

Impresso 2 release history

(to come)

Popular repositories Loading

  1. named-entity-tutorial-dh2019 named-entity-tutorial-dh2019 Public

    Tutorial on NE processing for Digital Humanities - DH Utrech 2019

    Jupyter Notebook 25 4

  2. CLEF-HIPE-2020 CLEF-HIPE-2020 Public

    Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.

    SCSS 22 5

  3. NZZ-black-letter-ground-truth NZZ-black-letter-ground-truth Public

    9 1

  4. impresso-text-acquisition impresso-text-acquisition Public

    🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

    Jupyter Notebook 7 2

  5. impresso-frontend impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    Vue 5

  6. impresso.github.io impresso.github.io Public

    HTML 3 4

Repositories

Showing 10 of 48 repositories
  • impresso-middle-layer Public

    Middle layer API

    impresso/impresso-middle-layer’s past year of commit activity
    JavaScript 0 AGPL-3.0 1 14 1 Updated Jan 22, 2025
  • impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    impresso/impresso-frontend’s past year of commit activity
    Vue 5 AGPL-3.0 0 178 (2 issues need help) 7 Updated Jan 21, 2025
  • impresso-linguistic-processing Public

    Code for running spaCy on rebuilt impresso data.

    impresso/impresso-linguistic-processing’s past year of commit activity
    Python 0 AGPL-3.0 0 1 0 Updated Jan 21, 2025
  • impresso-schemas Public

    Repository of JSON schemas used in the Impresso project.

    impresso/impresso-schemas’s past year of commit activity
    Python 3 AGPL-3.0 3 2 0 Updated Jan 21, 2025
  • impresso-text-acquisition Public

    🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

    impresso/impresso-text-acquisition’s past year of commit activity
    Jupyter Notebook 7 AGPL-3.0 2 30 (1 issue needs help) 0 Updated Jan 21, 2025
  • transmedia Public

    Website for the Transmedia History Conference

    impresso/transmedia’s past year of commit activity
    HTML 1 AGPL-3.0 0 0 0 Updated Jan 21, 2025
  • paraphrasus Public
    impresso/paraphrasus’s past year of commit activity
    Jupyter Notebook 2 AGPL-3.0 1 0 0 Updated Jan 21, 2025
  • impresso-make-cookbook Public

    Repo for a make-based cookbook for (nlp) offline processing steps

    impresso/impresso-make-cookbook’s past year of commit activity
    0 AGPL-3.0 0 1 0 Updated Jan 20, 2025
  • impresso-essentials Public

    ⚙️ Python package highly reusable modules and functions within impresso.

    impresso/impresso-essentials’s past year of commit activity
    Python 0 GPL-3.0 1 9 1 Updated Jan 20, 2025
  • impresso-jscommons Public

    Reusable components for impresso-frontend and impresso-middle-layer

    impresso/impresso-jscommons’s past year of commit activity
    JavaScript 0 AGPL-3.0 0 0 1 Updated Jan 16, 2025