[WIP/not-for-merge] Initial resnet implementation#1613
Closed
danpovey wants to merge 14 commits intokaldi-asr:kaldi_52from
Closed
[WIP/not-for-merge] Initial resnet implementation#1613danpovey wants to merge 14 commits intokaldi-asr:kaldi_52from
danpovey wants to merge 14 commits intokaldi-asr:kaldi_52from
Conversation
…simplify resnet config generation; more examples.
Contributor
Author
|
@freewym, actually maybe you can start from this point. I'm running it in /home/dpovey/kaldi-chain/egs/cifar/v1; it's not finished running yet, but it's the cleanest version of ResNet (i.e. the closest to published setups). |
…elated options in training; slight refactoring of how shrinkage is applied in all scripts.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This is an implementation of ResNets. They do seem helpful for CIFAR, e.g. we're getting 91% accuracy, although based on the papers we're referring to we should be getting better accuracy, around 95%.
I made it a bit more general than it needed to be, with some features that I now realize are not helpful, so I'm going to modify and simplify the implementation before this is ready for check in. I am putting up this PR here just in case collorabors like @hhadian want to have a look.
This PR also contains some stuff that didn't really work regarding image augmentation in RGB space.
@freewym, you probably shouldn't use this, I'm working today on a more definitive implementation.