-
Notifications
You must be signed in to change notification settings - Fork 5.4k
changes for GALE mandarin setup #1207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
I don't like the weird RESULTS filename. |
|
I guess there's no hurry to merge, but do you see a reason why we would not On Tue, Nov 22, 2016 at 5:09 PM, jtrmal [email protected] wrote:
|
|
I can, of course, change it -- no issue in that. Maybe the original idea was that the RESULTS file won't be overwritten by people running the recipes and generating their own results and if the results should be used as a reference, they should be manually renamed to RESULTS -- in that case I'm at fault, I wasn't sure of the reasons. |
|
Thank you Yenda for the updated numbers. I was curious why there wasn't Actually, just think out loud, there is no such thing as 'eval2000' and Xingyu On 2016/11/23 6:09, jtrmal wrote:
|
|
Xingyu, I'd say go ahead and do what you think is best, taking into account On Tue, Nov 22, 2016 at 9:33 PM, Xingyu Na [email protected] wrote:
|
|
OK. On 2016/11/23 10:39, Daniel Povey wrote:
|
|
@jtrmal I don't see where you use all data to train. It's still running on LDC2013S08 and LDC2013T20. |
|
Let me check, maybe I didn't commit the changes to run.sh On Nov 23, 2016 8:11 AM, "Xingyu Na" [email protected] wrote:
|
|
I commit the run.sh and path.sh. Also renamed the RESULTS file. Sorry about On Wed, Nov 23, 2016 at 8:26 AM, Jan Trmal [email protected] wrote:
|
| exit 1 | ||
| fi | ||
| fi | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this removed? g2p.py is used later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There should be a helpful error message if g2p is not found.
|
we have an installation script in tools/extras/
It should be checking for availability of g2p.py and printing helpful error
message in case the g2p is not found, though -- my bad
y.
…On Thu, Nov 24, 2016 at 4:51 AM, Xingyu Na ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In egs/gale_mandarin/s5/local/gale_prep_dict.sh
<#1207 (review)>:
> -export PYTHONPATH=$PYTHONPATH:`pwd`/tools/g2p/lib/python${pyver}/site-packages
-if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
- echo "--- Downloading Sequitur G2P ..."
- echo "NOTE: it assumes that you have Python, NumPy and SWIG installed on your system!"
- wget -P tools http://www-i6.informatik.rwth-aachen.de/web/Software/g2p-r1668.tar.gz
- tar xf tools/g2p-r1668.tar.gz -C tools
- cd tools/g2p
- echo '#include <cstdio>' >> Utility.hh # won't compile on my system w/o this "patch"
- python setup.py build
- python setup.py install --prefix=.
- cd ../..
- if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
- echo "Sequitur G2P is not found - installation failed?"
- exit 1
- fi
-fi
Why is this removed? g2p.py is used later.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1207 (review)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKisX0LFXjrEKOT5tBk7xQ_mgD6ljb9fks5rBV4fgaJpZM4K5nPZ>
.
|
naxingyu
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Besides the comments, the RESULT file should be renamed. And did you plan to commit the changes you made on the scoring script?
| exit 1 | ||
| fi | ||
| fi | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There should be a helpful error message if g2p is not found.
|
|
||
| wait | ||
| local/nnet/run_dnn.sh | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in local/nnet/run_dnn.sh, there is a invalid option to steps/nnet/train.sh (--use-gpu-id) and a missing done near the end of the script.
| @@ -0,0 +1,61 @@ | |||
| #!/bin/bash | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This new script is not called in run.sh. We may think of deprecate the original split_wer.sh and rename this one as split_wer.sh. That one was errornous anyway.
|
Xingyu, I don't think Yenda was planning to do much more work on this. I
think what's best is if you merge changes from his PR as you like, fix it
in the way that you think best, and then make your own pull request, whih
we can merge.
…On Wed, Nov 30, 2016 at 3:26 AM, Xingyu Na ***@***.***> wrote:
***@***.**** commented on this pull request.
Besides the comments, the RESULT file should be renamed. And did you plan
to commit the changes you made on the scoring script?
------------------------------
In egs/gale_mandarin/s5/local/gale_prep_dict.sh
<#1207 (review)>:
> -export PYTHONPATH=$PYTHONPATH:`pwd`/tools/g2p/lib/python${pyver}/site-packages
-if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
- echo "--- Downloading Sequitur G2P ..."
- echo "NOTE: it assumes that you have Python, NumPy and SWIG installed on your system!"
- wget -P tools http://www-i6.informatik.rwth-aachen.de/web/Software/g2p-r1668.tar.gz
- tar xf tools/g2p-r1668.tar.gz -C tools
- cd tools/g2p
- echo '#include <cstdio>' >> Utility.hh # won't compile on my system w/o this "patch"
- python setup.py build
- python setup.py install --prefix=.
- cd ../..
- if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
- echo "Sequitur G2P is not found - installation failed?"
- exit 1
- fi
-fi
There should be a helpful error message if g2p is not found.
------------------------------
In egs/gale_mandarin/s5/run.sh
<#1207 (review)>:
> data/dev exp/sgmm_5a/decode exp/sgmm_5a_mmi_onlyRescoreb0.1/decode$n
done
+wait
local/nnet/run_dnn.sh
in local/nnet/run_dnn.sh, there is a invalid option to steps/nnet/train.sh
(--use-gpu-id) and a missing done near the end of the script.
------------------------------
In egs/gale_mandarin/s5/local/split_wer_per_corpus.sh
<#1207 (review)>:
> @@ -0,0 +1,61 @@
+#!/bin/bash
This new script is not called in run.sh. We may think of deprecate the
original split_wer.sh and rename this one as split_wer.sh. That one was
errornous anyway.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1207 (review)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADJVu_Actf0WZSLERSBp8adcybCGZatnks5rDTMdgaJpZM4K5nPZ>
.
|
|
OK.
在 2016/12/1 4:31, Daniel Povey 写道:
… Xingyu, I don't think Yenda was planning to do much more work on this. I
think what's best is if you merge changes from his PR as you like, fix it
in the way that you think best, and then make your own pull request, whih
we can merge.
On Wed, Nov 30, 2016 at 3:26 AM, Xingyu Na ***@***.***>
wrote:
> ***@***.**** commented on this pull request.
>
> Besides the comments, the RESULT file should be renamed. And did you
plan
> to commit the changes you made on the scoring script?
> ------------------------------
>
> In egs/gale_mandarin/s5/local/gale_prep_dict.sh
>
<#1207 (review)>:
>
> > -export
PYTHONPATH=$PYTHONPATH:`pwd`/tools/g2p/lib/python${pyver}/site-packages
> -if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
> - echo "--- Downloading Sequitur G2P ..."
> - echo "NOTE: it assumes that you have Python, NumPy and SWIG
installed on your system!"
> - wget -P tools
http://www-i6.informatik.rwth-aachen.de/web/Software/g2p-r1668.tar.gz
> - tar xf tools/g2p-r1668.tar.gz -C tools
> - cd tools/g2p
> - echo '#include <cstdio>' >> Utility.hh # won't compile on my
system w/o this "patch"
> - python setup.py build
> - python setup.py install --prefix=.
> - cd ../..
> - if [ ! -f tools/g2p/lib/python${pyver}/site-packages/g2p.py ]; then
> - echo "Sequitur G2P is not found - installation failed?"
> - exit 1
> - fi
> -fi
>
>
> There should be a helpful error message if g2p is not found.
> ------------------------------
>
> In egs/gale_mandarin/s5/run.sh
>
<#1207 (review)>:
>
> > data/dev exp/sgmm_5a/decode exp/sgmm_5a_mmi_onlyRescoreb0.1/decode$n
> done
>
> +wait
> local/nnet/run_dnn.sh
>
>
> in local/nnet/run_dnn.sh, there is a invalid option to
steps/nnet/train.sh
> (--use-gpu-id) and a missing done near the end of the script.
> ------------------------------
>
> In egs/gale_mandarin/s5/local/split_wer_per_corpus.sh
>
<#1207 (review)>:
>
> > @@ -0,0 +1,61 @@
> +#!/bin/bash
>
> This new script is not called in run.sh. We may think of deprecate the
> original split_wer.sh and rename this one as split_wer.sh. That one was
> errornous anyway.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
>
<#1207 (review)>,
> or mute the thread
>
<https://github.com/notifications/unsubscribe-auth/ADJVu_Actf0WZSLERSBp8adcybCGZatnks5rDTMdgaJpZM4K5nPZ>
> .
>
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1207 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADKpxGKmUWKzXMXbs5iDaaik6W9XJg91ks5rDd0QgaJpZM4K5nPZ>.
|
|
This is being handled by #1253, so closing... |
I did some changes in order to allow me to train on all data.
depends on @naxingyu and @danpovey if they decide merge or if a better strategy would be only cherry-pick some files or changes.
y.