Handwritten Word Recognition Pipeline

Deep learning technologies have greatly improved the performance of modern hand-written word recognition.
However, a huge amount and variety of data is needed for training the model.
We propose an automated writer adaption approach with which a trained handwritten word recognizer could adjust towards a new writing style.
To achieve this goal, synthesized samples from a writer are produced by a generative adversarial network and forwarded to fine-tune the recognizer.

More information about the models used can be found here:

Installation

The necessary dependencies can be installed with pip install -r requirements.txt. This project runs with python 3.8.
For testing the models and calculating the CER, gawk has to be installed: apt install gawk.

Pretrained Models

Pretrained models for both the HWR and the GAN are part of this repository and can be downloaded via git-lfs. Simply install git-lfs and run git lfs pull. (Don't forget to run git lfs install if you're using git-lfs for the first time).
Other models can also be used but need to be referenced in the config.yaml.

Usage

The synthesized_training.py script starts the pipeline. By default, it expects input images in the input_images folder.

One can also optionally test the trained model. For doing this, put test images in the default test_images folder and also a labels.txt file for these images that looks like this:

test_img_1.png hello
test_img_2.png world

Afterwards the pipeline can be started with python3 synthesized_training.py -t

If writers of the IAM dataset should be trained, the IAM words directory needs to be referenced in the config.yaml file. It expects the IAM dataset in a "flat form", i.e, not in the standard directory structure from the original dataset zip file but all images directly in the referenced directory.
Then a given writer can be trained with python3 synthesized_training.py --iam <Writer_ID>. The writer id is a three-digit number that from 000 to around 650.
Testing can again be executed with the -t flag. Testing is done on all images of this specific writer in the IAM dataset. The number of images to be synthesized can be adapted with the --n_generated_images flag.

Folder structure

HWR/ - Code of the Handwritten Word Recognizer (research-seq2seq-HTR)
GAN/ - Code of the Generative Adversarial Network (research-GANwriting)
pretrained_models/ - Pretrained models of both the GAN and the HWR
data/ - Contains mapping from IAM writers to their images

Output Folder:

model_weights/ - Saves the trained models after running the pipeline
evaluations/ - Saves the predictions of the models
synthesized_images/ - Saves images generated by the GAN
label_files/ - Saves the labels for generated images

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
GAN		GAN
HWR		HWR
data		data
pretrained_models		pretrained_models
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
config.yaml		config.yaml
pipeline.jpg		pipeline.jpg
requirements.txt		requirements.txt
synthesized_training.py		synthesized_training.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handwritten Word Recognition Pipeline

Installation

Pretrained Models

Usage

Folder structure

About

Releases

Packages

Contributors 3

Languages

caustt/HWR_using_GAN

Folders and files

Latest commit

History

Repository files navigation

Handwritten Word Recognition Pipeline

Installation

Pretrained Models

Usage

Folder structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages