Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Random notes #1

Open
hypnopump opened this issue Aug 11, 2021 · 5 comments
Open

Random notes #1

hypnopump opened this issue Aug 11, 2021 · 5 comments

Comments

@ClashLuke
Copy link

You could also check out uRNN, OSiPaRC and https://arxiv.org/abs/1909.00021

@hypnopump
Copy link
Owner Author

oh cool! Added urnn and delayed to the list. Could you point me to an OSiPaRC reference?

@ClashLuke
Copy link

OSiPaRC unfortunately doesnt have a paper. It's a more powerful and general uRNN implementation with constant memory consumption.

@hypnopump
Copy link
Owner Author

oh, you mean this? https://github.com/HomebrewNLP/HomebrewNLP/blob/b7d1975c31568d0ade81df5e415a5a4fa755ef6b/module.py#L117 ? have you measured execution performance wrt a classic LSTM?

RGN2 models are a bit slow to train already...

@ClashLuke
Copy link

Exactly, yeah. I dont recall how it compares to LSTM, but uRNN claims to outperform LSTM and this is basically a more powerful uRNN with RevNet bolted onto it.
I'd expect it to have a worse step time but overall faster convergence, but we definitely need to test that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants