[RFC] Batch normalization + LeNet with batchnorm example #140

CarloLucibello · 2017-07-01T16:10:06Z

I implemented both the version wich keeps the moment of each batch in the training set and the one with exponential decay average (now commented out), following some indications in the discussion in #139

In my experiments on cpu with the LeNet example the extra operations needed by batchnorm an increase of
+160% for decaying average
+130% for "keep each batch moments"
in computational time for each epoch with respect to lenet.jl example.
With decaying average we spare on memory resources, which can have some impact on really large training set with small batchsizes. Which one should I keep?

@denizyuret @ilkerkesen @AStupidBear @mambuDL any comments?

Exports the batchnorm function and the BatchMoments type. fixes #139

- add LeNet + batchnorm example

jgbos · 2017-07-01T17:03:08Z

When it comes to modules I don't know if you'd be interested in what I have been playing with here

https://github.com/jgbos/KFuddles

I go a little further with the module to allow the macro @sequence to automatically build the vector of weights and naming of parameters for saving. It's a bit hacky for how I deal with indexing the weights though. Feel free to use any of the code if it's useful.

CarloLucibello · 2017-07-01T17:18:08Z

When it comes to modules I don't know if you'd be interested in what I have been playing with here

https://github.com/jgbos/KFuddles

That is some really nice work! As much as I love the transparency of Knet, I would also like for it to also offer that kind of higher level interface, has most deep learning frameworks do. Maybe @denizyuret has already some plans for that or would be interested in that sort of design discussion?

jgbos · 2017-07-01T17:33:40Z

yes, i think there is a case for building the weights and functions without high level interfaces. But I also utilize the Sequential function in PyTorch all the time, so good to have both I think?

CarloLucibello · 2017-07-01T17:42:42Z

also, working with layer types, who can keep an internal state, as for instance is needed in batch normalization, makes for cleaner code where you do not have to pass parameters all around, as I do with bmom in this PR.

jgbos · 2017-07-01T17:46:28Z

yeah, batch normalization was the main reason I started my code, the same reason you built BatchMoments. I also wanted to way to automatically build the vector of weights and save parameters with unique names.

denizyuret · 2017-10-19T10:55:35Z

I plan to work on a standard module interface ala #152 for 0.8.6. I will look at this in the context of the whole modular style.

denizyuret · 2017-11-29T09:02:37Z

I decided to work on better CUDNN integration for 0.8.6. In particular the general CNN/RNN speed went up considerably. Please see #193 for @cgumeli's integration of the CUDNN batchnorm, we are still ironing out the interface. I have not benchmarked but suspect it will beat manual implementations. We will also look at dropout and softmax from CUDNN and replace existing implementations if they offer significant performance boost.

CarloLucibello added 2 commits July 1, 2017 17:53

- add batchnorm

b33940b

- add LeNet + batchnorm example

change defaults in lenet_batchnorm example

e9c7306

denizyuret added the interface label Oct 20, 2017

CarloLucibello mentioned this pull request Oct 27, 2017

layer: implement BatchNorm layer FluxML/Flux.jl#84

Merged

CarloLucibello closed this Dec 2, 2017

s271 mentioned this pull request May 2, 2019

reduce operations produce error for KnetArray if last dimension has size 1 #461

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Batch normalization + LeNet with batchnorm example #140

[RFC] Batch normalization + LeNet with batchnorm example #140

CarloLucibello commented Jul 1, 2017

jgbos commented Jul 1, 2017

CarloLucibello commented Jul 1, 2017 •

edited

Loading

jgbos commented Jul 1, 2017

CarloLucibello commented Jul 1, 2017

jgbos commented Jul 1, 2017

denizyuret commented Oct 19, 2017

denizyuret commented Nov 29, 2017

[RFC] Batch normalization + LeNet with batchnorm example #140

[RFC] Batch normalization + LeNet with batchnorm example #140

Conversation

CarloLucibello commented Jul 1, 2017

jgbos commented Jul 1, 2017

CarloLucibello commented Jul 1, 2017 • edited Loading

jgbos commented Jul 1, 2017

CarloLucibello commented Jul 1, 2017

jgbos commented Jul 1, 2017

denizyuret commented Oct 19, 2017

denizyuret commented Nov 29, 2017

CarloLucibello commented Jul 1, 2017 •

edited

Loading