(deprecated, will reimplement by jax) under development may not work until whole pipeline done

neural-lm

focus on fusion on speech recognition

Note

When a language model is used wide beam searches often yield incomplete transcripts. With narrow beams, the problem is less visible due to implicit hypothesis pruning.

See if it appears in ctc+lm fusion

TODO

adaptive softmax for large voca (because pytorch offical implementation can't work with torchscript)
onnx support and torchscript
gru
rnn tie embedding
gru fusion on wenet runtime ctc prefix beam search
transformer-xl with cache
transformer-xl with cache to fusion
mwer training when lm fusion
etc

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
cpp		cpp
dataset		dataset
models		models
tools		tools
utils		utils
.clang-format		.clang-format
.gitignore		.gitignore
README.md		README.md
main.py		main.py
test.yaml		test.yaml
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

(deprecated, will reimplement by jax) under development may not work until whole pipeline done

neural-lm

Note

TODO

reference

About

Releases

Packages

Languages

Mddct/neural-lm-deprecated

Folders and files

Latest commit

History

Repository files navigation

(deprecated, will reimplement by jax) under development may not work until whole pipeline done

neural-lm

Note

TODO

reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages