add hlg decode #1521

aluminumbox · 2022-10-26T11:22:53Z

We add hlg decode using k2. Now we can use k2 to compile a HLG, and decode using python with cuda.
We provide two decoding algorithm. One is onebest, which is same as ctc_prefix_beam_search with lm score, another is attention rescore.
Notice that in attention rescore, we have 3 different scale parameter, which are lm_scale, decoder_scale and r_decoder_scale.
Special thanks to K2 groups great work!

robin1001 · 2022-10-26T13:00:03Z

wenet/transformer/asr_model.py

@@ -20,6 +20,10 @@

 from torch.nn.utils.rnn import pad_sequence

+import k2


move it into hlg_onebest & hlg_rescore, so we can run it without k2 & icefall.

To avoid duplicate code, we use move these import into try except

robin1001 · 2022-10-26T13:02:44Z

examples/aishell/s0/README.md

@@ -28,6 +28,8 @@
 | ctc prefix beam search    | 5.17  | 5.81  |
 | attention rescoring       | 4.63  | 5.05  |
 | LM + attention rescoring  | 4.40  | 4.75  |
+| HLG                       | 4.81  | 5.27  |


How about HLG(k2 LM), so it is easy to understand for the users.

robin1001

please see inline.

csukuangfj · 2022-10-27T06:46:19Z

What is the motivation to add HLG from k2?

aluminumbox · 2022-10-27T06:58:27Z

What is the motivation to add HLG from k2?

k2 can do batch decoding with cuda, and it has python interface.
It is easier to use and faster compared with wenet runtime tlg.
Also, it can extract lm score from tot score, so we have more hyperparameter to tune when decoding. And the results on aishell is 4.32, which is better than runtime tlg lm, 4.40

csukuangfj · 2022-10-27T07:00:27Z

What is the motivation to add HLG from k2?

k2 can do batch decoding with cuda, and it has python interface. It is easier to use and faster compared with wenet runtime tlg

Thanks! Do you have some benchmarks to share?

csukuangfj · 2022-10-27T06:54:44Z

examples/aishell/s0/run.sh

+# Optionally, you can decode with k2 hlg
+if [ ${stage} -le 8 ] && [ ${stop_stage} -ge 8 ]; then
+  if [ ! -f data/local/lm/lm.arpa ]; then
+    echo "Please run prepare dict and train lm in Stage 7"


Do we need to add

exit 1

to stop processing?

csukuangfj · 2022-10-27T06:56:39Z

examples/aishell/s0/run.sh

+  fi
+
+  # 8.1 Build decoding HLG
+  tools/k2/make_hlg.sh data/local/dict/ data/local/lm/ data/local/hlg


Shall we skip this step if the file is already generated?

aluminumbox · 2022-10-27T07:04:51Z

What is the motivation to add HLG from k2?

k2 can do batch decoding with cuda, and it has python interface. It is easier to use and faster compared with wenet runtime tlg

Thanks! Do you have some benchmarks to share?

The wer performance is already compared in README. For speed comparision, wenet runtime default using batchsize=1, which is not comparable with hlg decode using batchsize=16. I will do the speed benchmark later in a fair scenario

robin1001 · 2022-10-27T07:06:21Z

It is more convienent to use FSA in python than c++ in openfst, especially for the beginners. And Dan thinks it is okay to support k2 in wenet.

robin1001 · 2022-10-27T07:08:17Z

Besides, we can use k2 with GPU.

zw76859420 · 2022-10-27T12:29:23Z

We add hlg decode using k2. Now we can use k2 to compile a HLG, and decode using python with cuda. We provide two decoding algorithm. One is onebest, which is same as ctc_prefix_beam_search with lm score, another is attention rescore. Notice that in attention rescore, we have 3 different scale parameter, which are lm_scale, decoder_scale and r_decoder_scale. Special thanks to K2 groups great work!

Thank you for sharing, but we would like to know how we can access HLG Decoding using c++？

zw76859420 · 2022-10-27T12:33:56Z

It‘s unpossible that we load the dynamic libraries provided by k2 and wenet at the same time.
So, can we combine both wenet and k2 in a simple way?

csukuangfj · 2022-10-27T12:53:53Z

It‘s unpossible that we load the dynamic libraries provided by k2 and wenet at the same time.

Why is it impossible?

Have you used the same version of PyTorch and CUDA to compile k2 and wenet?

robin1001 reviewed Oct 26, 2022

View reviewed changes

robin1001 requested changes Oct 26, 2022

View reviewed changes

csukuangfj reviewed Oct 27, 2022

View reviewed changes

add hlg decode

b83a68a

robin1001 approved these changes Oct 27, 2022

View reviewed changes

robin1001 merged commit cd3fcb5 into wenet-e2e:main Oct 27, 2022

csukuangfj mentioned this pull request Oct 31, 2022

Support HLG decoding for models trained using CTC loss k2-fsa/sherpa#176

Closed

xingchensong mentioned this pull request Dec 22, 2022

Export ONNX fail with export_onnx_gpu.py #1630

Closed

xingchensong mentioned this pull request Mar 9, 2023

wfst_beam_search解码速度慢 #1713 #1723

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add hlg decode #1521

add hlg decode #1521

aluminumbox commented Oct 26, 2022 •

edited

Loading

robin1001 Oct 26, 2022

aluminumbox Oct 27, 2022

robin1001 Oct 26, 2022

aluminumbox Oct 27, 2022

robin1001 left a comment

csukuangfj commented Oct 27, 2022

aluminumbox commented Oct 27, 2022 •

edited

Loading

csukuangfj commented Oct 27, 2022

csukuangfj Oct 27, 2022

aluminumbox Oct 27, 2022

csukuangfj Oct 27, 2022

aluminumbox Oct 27, 2022

aluminumbox commented Oct 27, 2022

robin1001 commented Oct 27, 2022

robin1001 commented Oct 27, 2022

zw76859420 commented Oct 27, 2022

zw76859420 commented Oct 27, 2022

csukuangfj commented Oct 27, 2022 •

edited

Loading

		@@ -20,6 +20,10 @@

		from torch.nn.utils.rnn import pad_sequence

		import k2

add hlg decode #1521

add hlg decode #1521

Conversation

aluminumbox commented Oct 26, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robin1001 left a comment

Choose a reason for hiding this comment

csukuangfj commented Oct 27, 2022

aluminumbox commented Oct 27, 2022 • edited Loading

csukuangfj commented Oct 27, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aluminumbox commented Oct 27, 2022

robin1001 commented Oct 27, 2022

robin1001 commented Oct 27, 2022

zw76859420 commented Oct 27, 2022

zw76859420 commented Oct 27, 2022

csukuangfj commented Oct 27, 2022 • edited Loading

aluminumbox commented Oct 26, 2022 •

edited

Loading

aluminumbox commented Oct 27, 2022 •

edited

Loading

csukuangfj commented Oct 27, 2022 •

edited

Loading