Improving Document Binarization via Adversarial Noise-Texture Augmentation [paper] [ICIP 2019]
This repository contains the full source codes and instructions to use the codes on the datasets described in the paper. The paper re-visits the binarization problem by introducingan adversarial learning approach.
The most significant contribution of our framework is that it does not require any paired data unlike other Deep Learning-based methods [ronneberger et al., Vo et al.,Konwer et al.].Such a novel approach has never been implemented earlier thus making it the very first of its kind in Document Image Analysis community.
In this paper, we propose a two-stage network that first learns to augment the document images by using neural style transfer technique. For this purpose, we construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image.
In this way, the network creates multiple versions of the same textual content with various noisy textures, enlarging the available document binarization datasets. At last, the newly generated images are passed through a Binarization network to get back the clean version.
- python 2.7
- Tensorflow
- OpenCV
- matplotlib
- A total of 9 datasets are used in this work: DIBCO 2009, DIBCO 2011, DIBCO 2013, H-DIBCO 2010, HDIBCO 2012, H-DIBCO 2014, Bickley diary, PHIDB, and S-MS datasets.
- Out of these datasets, DIBCO 2013 dataset is selected for testing purposes. For the testing, the remaining datasets are used as a training set.
- We convert the images from these datasets to patches of size 256 X 256.
- Download the VGG weights from here and put it in the repository folder.
- To train the model run the 'train.py' file
If you find this code useful in your research, please consider citing:
@article{bhunia2018improving,
title={Improving Document Binarization via Adversarial Noise-Texture Augmentation},
author={Bhunia, Ankan Kumar and Bhunia, Ayan Kumar and Sain, Aneeshan and Roy, Partha Pratim},
journal={arXiv preprint arXiv:1810.11120},
year={2018}
}