-
Notifications
You must be signed in to change notification settings - Fork 5.4k
madcat arabic: clean scripts, tuning, rescoring, text localization #2716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
84 commits
Select commit
Hold shift + click to select a range
5fe6cb2
minor change
aarora8 e1f4530
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into ar1
aarora8 c3443d2
updating run_end2end for text localization
aarora8 9c6a923
adding higher language model
aarora8 2c87fe5
fixing bug
aarora8 053fbdb
minor fix
aarora8 837fd4d
adding augmentation
aarora8 9c1d553
updating parameters
aarora8 47b6508
updating parameters
aarora8 18f585e
updating parameters
aarora8 cf22d16
minor cleaning and higher order language model
aarora8 13f2386
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into ar2
aarora8 95aed10
updating results
aarora8 85e3649
minor fix and adding tuning directory
aarora8 bff652c
adding overwrite variable
aarora8 303246e
adding documentation, fixing run.sh, minor fix
aarora8 6b857de
adding text localization changes
aarora8 1bd1448
adding gpu = false for alignments in runend2end
aarora8 895342a
updating text localization routine
aarora8 a72d922
removing unused function
aarora8 b2ef923
minor change
aarora8 b8974aa
adding option for augmentation
aarora8 04b938c
updating text localization routines
aarora8 2a35cf7
fixing merge conflict
aarora8 92a470d
removing unnecessary files
aarora8 8a9b46a
Merge branch 'ar1' of https://github.com/aarora8/kaldi into ar3
aarora8 d7092e4
Merge branch 'ar1' of https://github.com/aarora8/kaldi into ar3
aarora8 9271545
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into ar1
aarora8 86ea346
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into ar3
aarora8 e7b7597
adding lm rescoring, cleaning in chain scripts
aarora8 e647607
minor fix
aarora8 c1c06d0
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar1
aarora8 a0d2b68
removing prepend words
aarora8 53edde4
minor bug fix
aarora8 e4f973d
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar1
aarora8 e9ae853
fixing run.sh
aarora8 8d0c793
removing prepare data
aarora8 ee582d5
fixing run.sh
aarora8 a16a11d
removing reverse.py
aarora8 fb0b8a2
removing prepare data
aarora8 7835ed4
adding augmentation during line image creation, removing unnecessary …
aarora8 0234a1a
adding chain recepi
aarora8 a17fbb3
minor fix
aarora8 59c84f2
bug fix
aarora8 a23b478
fixing bugs
aarora8 a3aac1a
fixing bugs
aarora8 8fc860d
bug fix
aarora8 cafd89a
fixing bug in subset
aarora8 87c9241
adding augmentation in text localization
aarora8 0e74e55
fixing bugs
aarora8 60915aa
fixing bugs
aarora8 4099d4a
fixing bugs
aarora8 4f98f69
fixing bugs
aarora8 717501f
fixing bugs
aarora8 56c77c4
fixing bugs
aarora8 b9d2651
fixing bugs
aarora8 74f7a82
fixing bugs
aarora8 7597638
fixing bugs
aarora8 479590a
fixing run.sh
aarora8 87ab218
fixing bug in language modelling
aarora8 d979000
correcting options
aarora8 ed3ab45
adding comments
aarora8 fa34b22
merge conflict
aarora8 22df693
fixing conflict
aarora8 95b1c3a
updating chain parameters
aarora8 8e40c2e
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar2
aarora8 0b71dae
updating chain parameters
aarora8 a5d04ec
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar2
aarora8 e380a20
updating parameters
aarora8 639289d
updating parameters
aarora8 78135bb
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar2
aarora8 04e0236
updating parameters
aarora8 e1efebc
Merge branch 'ar3' of https://github.com/aarora8/kaldi into ar2
aarora8 9c33a35
fixing bug in make features
aarora8 d4516ea
Revert "fixing bug in make features"
aarora8 bac599a
modification from review
aarora8 9f0259f
Merge branch 'ar1' of https://github.com/aarora8/kaldi into ar2
aarora8 c0ac631
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into ar2
aarora8 f0a990e
modification from review, adding new augmentation in make feature
aarora8 09da981
minor fix
aarora8 3d9615e
fixing bugs
aarora8 c33da9f
adding doocumentation
aarora8 ee42879
modification from review
aarora8 405763b
Merge branch 'ar2' of https://github.com/aarora8/kaldi into ar1
aarora8 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you know how much does
--ngram-order=2 --no-prune-ngram-order=1alone help? I'm just curious.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Previously, I tried running run_e2e_cnn_1a.sh once with --ngram-order=2 --no-prune-ngram-order=1 and once with --num-extra-lm-states=500 but results were same for madcat arabic 7.81 vs 7.82 WER. But it was more helpful in Tamil OCR setup, it had a absolute WER improvement of around 0.5%.