Skip to content

Conversation

@aarora8
Copy link
Owner

@aarora8 aarora8 commented Oct 30, 2017

This branch is for handwritten word recognition on text line images. It uses word language model.

add scripts for data preparation (text, wav.scp and utt2spk file) (local/prepare_data.sh, local/process_data.py)
add scripts for feature extraction (local/make_feature_vect.py)
add scripts for lexicon, language modeling, grammar (egs/iam/s5/local/prepare_lm.sh, egs/iam/s5/local/prepare_lexicon.py, egs/iam/s5/local/prepare_dict.sh)
add script for GMM-HMM training and using chain model (egs/iam/s5/local/chain/run_cnn_1a.sh, egs/iam/s5/local/chain/align_nnet3_lats.sh, egs/iam/s5/run.sh, egs/iam/s5/local/chain/run_cnn_chainali_1a.sh)

@aarora8 aarora8 closed this Oct 31, 2017
aarora8 pushed a commit that referenced this pull request Jan 17, 2018
* OCR: Add IAM corpus with unk decoding support (#3)

* Add a new English OCR database 'UW3'

* Some minor fixes re IAM corpus

* Fix an issue in IAM chain recipes + add a new recipe (#6)

* Some fixes based on the pull request review

* Various fixes + cleaning on IAM

* Fix LM estimation and add extended dictionary + other minor fixes

* Add README for IAM

* Add output filter for scoring

* Fix a bug RE switch to pyhton3

* Add updated results + minor fixes

* Remove unk decoding -- gives almost no gain

* Add UW3 OCR database

* Fix cmd.sh in IAM + fix usages of train/decode_cmd in chain recipes

* Various minor fixes on UW3

* Rename iam/s5 to iam/v1

* Add README file for UW3

* Various cosmetic fixes on UW3 scripts

* Minor fixes in IAM
aarora8 pushed a commit that referenced this pull request Feb 21, 2018
* OCR: Add IAM corpus with unk decoding support (#3)

* Add a new English OCR database 'UW3'

* Some minor fixes re IAM corpus

* Fix an issue in IAM chain recipes + add a new recipe (#6)

* Some fixes based on the pull request review

* Various fixes + cleaning on IAM

* Fix LM estimation and add extended dictionary + other minor fixes

* Add README for IAM

* Add output filter for scoring

* Fix a bug RE switch to pyhton3

* Add updated results + minor fixes

* Remove unk decoding -- gives almost no gain

* Add UW3 OCR database

* Fix cmd.sh in IAM + fix usages of train/decode_cmd in chain recipes

* Various minor fixes on UW3

* Rename iam/s5 to iam/v1

* Add README file for UW3

* Various cosmetic fixes on UW3 scripts

* Minor fixes in IAM
aarora8 pushed a commit that referenced this pull request Oct 11, 2019
aarora8 pushed a commit that referenced this pull request Dec 4, 2019
Track 2 pipeline with SAD and Diarization
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant