There are currently some significant changes to eScriptorium planned. We have decided to delay adapting our Tesseract integration until after the completion of these major changes. Although we have a functional version for both transcription and training with Tesseract in eScriptorium, it still requires further refinement and we wouldn't recommend to use it in the current state for training (it only runs in synchronous way for now). Therefore, we would advise waiting until we have refined our implementation to a point where it is ready for widespread use. We plan to start the rework in summer this year.
We will keep you up to date with the rework process. In the meantime, please feel free to try our current implementation or take a closer look at your tool for PAGE XML data: https://github.com/UB-Mannheim/PagePlus.
eScriptorium is part of the Scripta, RESILIENCE and Biblissima+ projects, and has received funding from Université PSL and from The European Union's Horizon 2020 Research and Innovation Programme under Grant Agreement no. 871127, from the Programme d'investissements d'avenir of the Agence Nationale de Recheche under Grant Reference no. ANR-21-ESRE-0005, as well as from other contributors listed below. Its goal is provide researchers in the humanities with an integrated set of tools to transcribe, annotate, translate and publish historical documents. The eScriptorium app itself is at the 'center'. It is a work in progress but will implement at least automatic transcriptions through kraken, indexation for complex search and filtering, annotation and some simple forms of collaborative working such as sharing and versioning.
- nginx
- uwsgi
- django
- daphne (channel server for websockets)
- celery
- postgres
- elasticsearch (integration not started yet)
- redis (cache, celery broker, other disposable data)
- kraken
- docker (deployment)
Two options,
eScriptorium needs either Linux, Mac OS or (WSL on windows).
See Contributing to eScriptorium.
- École Pratique des Hautes Études (EPHE)
- Biblissima+
- Resilience
- PSL Scripta
- Institut national de recherche en sciences et technologies du numérique (INRIA)
- Archives nationales de France
- L’Institut de recherche et d’histoire des textes
- Open Islamicate Texts Initiative (OpenITI)
- The Andrew W. Mellon Foundation