Skip to content
Change the repository type filter

All

    Repositories list

    • Core libraries by the PRImA Research Lab
      HTML
      Apache License 2.0
      151663Updated Jul 30, 2024Jul 30, 2024
    • NAME-XML

      Public
      XML schemas for named entities and relations
      Apache License 2.0
      0300Updated Dec 2, 2023Dec 2, 2023
    • Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.
      HTML
      Apache License 2.0
      93582Updated May 25, 2023May 25, 2023
    • Text-related functionality (text comparison / evaluation, filtering, export etc.)
      Java
      Apache License 2.0
      0010Updated May 31, 2022May 31, 2022
    • PAGE-XML

      Public
      PAGE XML format collection for document image page content and more
      XSLT
      Apache License 2.0
      866101Updated Jul 7, 2021Jul 7, 2021
    • Web-based page layout editor created for EMOP (Early Modern OCR Project).
      Java
      Apache License 2.0
      51110Updated May 21, 2021May 21, 2021
    • Web-based viewer and editor for PAGE XML
      HTML
      Apache License 2.0
      1800Updated May 21, 2021May 21, 2021
    • Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
      HTML
      Apache License 2.0
      62370Updated Jan 30, 2021Jan 30, 2021
    • Java command line tool to convert PAGE XML files with layout and text content to PDF
      HTML
      Apache License 2.0
      21030Updated Apr 27, 2020Apr 27, 2020
    • PAGE Metadata Scanner is a command line tool that scans a single PAGE XML file (document layout and text content) and outputs its properties in CSV format.
      HTML
      Apache License 2.0
      2300Updated Nov 12, 2019Nov 12, 2019
    • root

      Public
      Some general stuff concerning PRImA tools
      0000Updated Sep 20, 2019Sep 20, 2019
    • Tool to call Google Cloud Vision OCR and save the result as PAGE XML
      Java
      Apache License 2.0
      0200Updated Sep 15, 2019Sep 15, 2019
    • Partial source code of PRImA Layout Evaluation Tool
      C++
      Apache License 2.0
      0200Updated Sep 6, 2019Sep 6, 2019
    • Semantic labelling - Ontology, search and matching algorithms, workflow tools
      Java
      Apache License 2.0
      5911Updated Oct 18, 2018Oct 18, 2018
    • Image processing functions used by PRImA tools
      C++
      Apache License 2.0
      2400Updated Oct 3, 2017Oct 3, 2017
    • Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdsourcing applications.
      HTML
      Apache License 2.0
      31410Updated Oct 3, 2017Oct 3, 2017