TextProposals

Implementation of the method proposed in the papers:

"TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild" (Gomez and Karatzas), Pattern Recognition, vol. 70, pp.60-74, 2017.
"Object Proposals for Text Extraction in the Wild" (Gomez & Karatzas), International Conference on Document Analysis and Recognition, ICDAR2015.

This code reproduces the results published on the papers for the SVT, ICDAR2013, ICDAR2015 datasets.

If you make use of this code, we appreciate it if you cite our papers:

@article{gomez2016,
  title     = {TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild},
  author    = {Lluis Gomez and Dimosthenis Karatzas},
  journal = "Pattern Recognition ",
  volume = "70",
  pages = "60 - 74",
  year = "2017"
}

@inproceedings{GomezICDAR15object,
  title     = {Object Proposals for Text Extraction in the Wild},
  author    = {Lluis Gomez and Dimosthenis Karatzas},
  booktitle = {ICDAR},
  year      = {2015}
}

For any questions please write us: ({lgomez,dimos}@cvc.uab.es). Thanks!

Includes the following third party code:

fast_clustering.cpp Copyright (c) 2011 Daniel Müllner, under the BSD license. http://math.stanford.edu/~muellner/fastcluster.html
binomial coefficient approximations are due to Rafael Grompone von Gioi. http://www.ipol.im/pub/art/2012/gjmr-lsd/

CNN models

The end-to-end evaluation require the DictNet_VGG model to be placed in the project root directory. DictNet_VGG Caffe model and prototxt are available here http://nicolaou.homouniversalis.org/assets/vgg_text/

Compilation

Requires: OpenCV 3.1 (tested with 02edfc8), Caffe (tested with d21772c), tinyXML

NOTE: Due to some changes on the OpenCV API, if you want to reproduce the results on our paper you will have to checkout to a specific commit of the opencv code. E.g.:

git clone https://github.com/opencv/opencv
cd opencv
git checkout 02edfc8df2f5dbb2ccb3a3e9a318a837f253190f

Then you can compile the TextProposals code as follows:

git clone https://github.com/lluisgomez/TextProposals
cd TextProposals
cmake .
make

(NOTE: you may need to change the include and lib paths to your Caffe and cuda installations in CMakeLists.txt file)

Run

./img2hierarchy <img_filename>

writes to stdout a list of proposals, one per line, with the format: x,y,w,h,c. where x,y,w,h define a bounding box, and c is a confidence value used to rank the proposals.

./img2hierarchy_cnn <img_filename>

same as before but for end-to-end recognition using the DictNet_VGG CNN model.

End-to-end Evaluation

The following commands reproduce end-to-end results in our paper:

./eval_IC03 data/ICDAR2003/SceneTrialTest/words.xml <LEX_SIZE>

./eval_SVT data/SVT/test.xml <LEX_SIZE>

./eval_IC15 <LEX_SIZE>

The value of LEX_SIZE parameter indicates the size of the lexicon to be used: 0 (for small lexicons), 1 (for Full lexicon), or 2 (for no lexicon, i.e. the 90k word vocabulary of the DictNet model).

Ground truth data for each dataset must be downloaded and placed in their respective folders in ./data/ directory.

In the case of ICDAR2015, since test ground truth is not available, the program save the results in res/ directory. These results files can be uploaded to the ICDAR Robust Reading Competition site for evaluation.

Object Proposal Evaluation

The following command lines generate a txt file with proposals for each image in the SVT and ICDAR2013 datasets respectively.

once the files are generated you may want to run the matlab code in the evaluation/ folder to get the IoU scores and plots.

Notice that the MATLAB evaluation script performs deduplicatioin of the bounding boxes proposals. Thus, if you use another evauation framework you must deduplicate proposals same way.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
evaluation		evaluation
lex		lex
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
agglomerative_clustering.cpp		agglomerative_clustering.cpp
agglomerative_clustering.h		agglomerative_clustering.h
eval_IC03.cpp		eval_IC03.cpp
eval_IC15.cpp		eval_IC15.cpp
eval_SVT.cpp		eval_SVT.cpp
fast_clustering.cpp		fast_clustering.cpp
image_contour.h		image_contour.h
lex.txt		lex.txt
main.cpp		main.cpp
main_cnn.cpp		main_cnn.cpp
min_bounding_box.cpp		min_bounding_box.cpp
min_bounding_box.h		min_bounding_box.h
nfa.cpp		nfa.cpp
region.cpp		region.cpp
region.h		region.h
stopping_rule.cpp		stopping_rule.cpp
stopping_rule.h		stopping_rule.h
trained_boost_groups.xml		trained_boost_groups.xml
utils.h		utils.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextProposals

CNN models

Compilation

Run

End-to-end Evaluation

Object Proposal Evaluation

About

Releases

Packages

Contributors 2

Languages

lluisgomez/TextProposals

Folders and files

Latest commit

History

Repository files navigation

TextProposals

CNN models

Compilation

Run

End-to-end Evaluation

Object Proposal Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages